In research the horizon recedes as we advance, and is no nearer at sixty than it was at twenty. As the power of endurance weakens with age, the urgency of the pursuit grows more intense ... And research is always incomplete.
— Mark Pattison (Isaac Casaubon)
As individuals, we all have slightly different genomes. If you compare the genomes of two people, you will find about 3 million base pair differences, which is about 0.1% of the genome.
This variation exists not only within the population but potentially also, to a lesser extent, among our cells, which number around 40 trillion. That's roughly 10,000 cells for each base in your 3 billion base genome. And each has a role to play.
| POG cases, by tissue type | |||
|---|---|---|---|
| n | % | ||
| Gastrointestinal ● | 141 | 25 | |
| Breast ● | 138 | 25 | |
| Thoracic ● | 57 | 10 | |
| Gynecologic ● | 45 | 8.3 | |
| Soft tissue ● | 44 | 8.1 | |
| Skin ● | 11 | 2.0 | |
| Urologic ● | 8 | 1.5 | |
| Hematologic ● | 7 | 1.3 | |
| Head and neck ● | 6 | 1.1 | |
| Endocrine ● | 5 | 0.9 | |
| Central nervous system ● | 5 | 0.9 | |
| Other ● | 78 | 14 | |
| ALL | 545 | ||
One consequence of this complexity and variation is that changes in the genome (through mutation or other processes) can have very different effects, depending on both the change and the genome. Cancer is a phenomena in which cells' ability to organize themselves as they divide is altered due to changes in the genome. It is an incredibly complex biological phenomenon—considering all the genomes in the population and all the possible changes that may arise, there is truly an inexhaustible number of ways in which the genome can break.
Cancers are classified according to their site of origin, such as lung, breast, liver, or colon. This is a coarse grouping—within each group there are many subtypes with differences in response to treatment and overall behaviour.
The design of the POG art highlights the diversity and similarity among cases. The diversity is what makes the study of cancer difficult and the similarities are what makes inference possible.
Each case is represented by three concentric rings. The width of each ring represents the extent to which the case is similar (as measured by correlation) to cancers of the type encoded by the color of the ring (see Methods).
In additional to the posters, I've created remixes for your desktop at 4k resolution.
This year, the cyclists in the Ride to Conquer Cancer will not only have the chance to raise money for research (as they've always done) but also do so while wearing data (as they've never done before).
You can purchase your own data-powered and human-driven cycling jersey.
My cover design on the 7 April 2026 Nature Biotechnology issue shows the dendrogram that represents a cluster of uniquely expressed (or downregulated) genes in human naive stem cells induced from such cells. Within each dendrogram block, the genomic barcode sequence (sampled from Supplementary Table 1) is depicted with a Code 39 barcode. The highlighted barcode is one of those used for cell isolation.
Ishiguro S. et al. A multi-kingdom genetic barcoding system for precise clone isolation (2026) Nature Biotechnology 44:616–629.
Browse my gallery of cover designs.
Celebrate π Day (March 14th) and enjoy the art — but only if you're part of the 5%.
Go ahead, see what you can't see.
Authentic and accurate images of Ishihara's test plates photographed (and lovingly color-corrected) from the 38-plate Ishihara's Tests for Colour Deficiency.
I also provide the position, size, and color of each circle on each test plate.
What immortal hand or eye, could frame thy fearful symmetry? — William Blake, "The Tyger"
This month, we look at symmetric regression, which, unlike simple linear regression, it is reversible — remaining unaltered when the variables are swapped.
Simple linear regression can summarize the linear relationship between two variables `X` and `Y` — for example, when `Y` is considered the response (dependent) and `X` the predictor (independent) variable.
However, there are times when we are not interested (or able) to distinguish between dependent and independent variables — either because they have the same importance or the same role. This is where symmetric regression can help.
Luca Greco, George Luta, Martin Krzywinski & Naomi Altman (2025) Points of significance: Symmetric alternatives to the ordinary least squares regression. Nat. Methods 22:1610–1612.
Fuelled by philanthropy, findings into the workings of BRCA1 and BRCA2 genes have led to groundbreaking research and lifesaving innovations to care for families facing cancer.
This set of 100 one-of-a-kind prints explore the structure of these genes. Each artwork is unique — if you put them all together, you get the full sequence of the BRCA1 and BRCA2 proteins.
The needs of the many outweigh the needs of the few. —Mr. Spock (Star Trek II)
This month, we explore a related and powerful technique to address bias: propensity score weighting (PSW), which applies weights to each subject instead of matching (or discarding) them.
Kurz, C.F., Krzywinski, M. & Altman, N. (2025) Points of significance: Propensity score weighting. Nat. Methods 22:638–640.