2026 π Day latest news buy art
Safe, fallen down this way, I want to be just what I am.Cocteau Twinssafe at lastmore quotes
very clickable
visualization + design

Genome Informatics 2010 cover

Genome Informatics, September 15-19, 2010 / Hinxton, UK

1 · The conference program cover

The program cover shows sequences of some of the genes and viruses that appear in the 2010 Genome Informatics conference's abstracts.

Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
GENOME INFORMATICS 2010 FRONT COVER | The conference program cover shows sequences of some of the proteins and genes reported in the abstracts drawn as paths
Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
GENOME INFORMATICS 2010 BACK COVER | The conference program cover shows sequences of some of the proteins and genes reported in the abstracts drawn as paths

The booklet was published with a black cover background. Below is an inverted and pinkish take on the cover.

Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
GENOME INFORMATICS 2010 FRONT AND BACK COVER | The conference program cover shows sequences of some of the proteins and genes reported in the abstracts drawn as paths

2 · Design of the cover

2.1 · Sequence as a path

Each sequence is represented by a continuous path. The length of the path is proportional to the length of the sequence.

2.2 · Path color — GC Content

At each point on the path, color is used to show the GC content computed over a window of 20 bases at that position.

Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
GC CONTENT ENCODING | GC content is encoded by color

Because the GC content doesn't vary greatly, values in the range 0.2–0.6 are mapped onto hues 0–300, with GC values outside that range assigned to the start and end hues. To smooth the color mpaping, a running average is calculated across 10 adjacent samples.

2.3 · Path direction — relative GC content

Direction of the curvature of the path is determined by the GC content relative to the average GC content of the human genome.

2.4 · Path curvature — Repeat content

The magnitude of path curvature is informed by the repeat content near that location, which is calculated by determining the average frequency of 10-mers sampled within a window of 200 bases relative to their frequency in the human exon sequence.

This quantity is expressed relative to the chance of observing these 10-mers randomly and used to inform the angle of the path. Regions that are composed of 10-mers that are relatively rare are straighter than those which contain repetitive regions.

Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
CURVATURE SHOWS REPEATS | The degree to which the path turns is informed by how much of the sequence at that position is repeated.

The path is confined within a circular area to keep it compact, at the cost of losing translational and rotational invariance of the representation. This limitation is due to the fact that the segments of the path depend on the angle and position at which the path approaches the circular boundary.

2.5 · Interpreting structure

For genes, the transcribed sequence is shown, which includes both introns and exons.

Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
GENES ARE HIGH-INFORMATION AREAS | Areas of high information are more straight (fewer repeats). Where sequence for areas outside genes and in repeats tend to curl up on themselves.

The overall effect of the path encoding is a qualitative, artistic interpretation of local sequence structure. Two paths can be directly compared to interrogate differences in their corresponding sequence.

3 · Deadly genome series

The Deadly Genomes poster demonstrates how entire genomes appear when encoded as paths. The poster compares the incidence rates and mortality of harmful viruses and bacteria, such as malaria, syphilis, AIDS and SARS.

Discover all the things that are not trying to make you stronger.
The cover design uses the same approach to depicting genomes as the Deadly Genomes poster.

As on the conference covers, on the poster each genome is drawn as a path. The length of the path is proportional to the size of the genome. Every fifth base is drawn as a circle whose color is based on the GC content (fraction of guanines and cytosines). The path curvature is proportional to the repeat content and the direction of curvature is determined by whether the GC content is lower or higher than average. Genomes are labeled by disease, organism, size (in bases) and GC content. Updated with the genome of SARS-CoV-2 (Wuhan-Hu-1 isolate) and COVID-19 case statistics as of 3 March 2020."

Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
DEADLY GENOMES | Genomes of harmful bacteria and viruses.

The poster was a finalist in the 2009 National Science Foundation Visualization Challenge.

news + thoughts

Nature Biotechnology cover

Thu 23-04-2026

My cover design on the 7 April 2026 Nature Biotechnology issue shows the dendrogram that represents a cluster of uniquely expressed (or downregulated) genes in human naive stem cells induced from such cells. Within each dendrogram block, the genomic barcode sequence (sampled from Supplementary Table 1) is depicted with a Code 39 barcode. The highlighted barcode is one of those used for cell isolation.

Ishiguro S. et al. A multi-kingdom genetic barcoding system for precise clone isolation (2026) Nature Biotechnology 44:616–629.

Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
My Nature Biotechnology phylogenetic tree cover (volume 44, issue 4, 7 April 2026). (more)

Browse my gallery of cover designs.

Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
A catalogue of my journal and magazine cover designs. (more)

Happy 2026 π Day—
Art for the 5%

Fri 13-03-2026

Celebrate π Day (March 14th) and enjoy the art — but only if you're part of the 5%.

Go ahead, see what you can't see.

Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
2026 π DAY | Art for the 5%. Shown in the style of Ishihara color test plates, the art is visible only to those with colour blindness. (details)

Ishihara's Tests for Colour Deficiency

Sun 08-03-2026

Authentic and accurate images of Ishihara's test plates photographed (and lovingly color-corrected) from the 38-plate Ishihara's Tests for Colour Deficiency.

I also provide the position, size, and color of each circle on each test plate.

Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
ISHIHARA'S TEST PLATE 6 | This plate is part of the set of transformation plates. If you see 5, you're ok. If you see 2, you're not. (details)
Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
ISHIHARA'S TEST PLATE 18 | This plate is part of the set of mysterious hidden plates. If you don't see anything, you're ok. If you see 5, you're not. (details)

Symmetric alternatives to the ordinary least squares regression

Wed 23-07-2025

What immortal hand or eye, could frame thy fearful symmetry? — William Blake, "The Tyger"

This month, we look at symmetric regression, which, unlike simple linear regression, it is reversible — remaining unaltered when the variables are swapped.

Simple linear regression can summarize the linear relationship between two variables `X` and `Y` — for example, when `Y` is considered the response (dependent) and `X` the predictor (independent) variable.

However, there are times when we are not interested (or able) to distinguish between dependent and independent variables — either because they have the same importance or the same role. This is where symmetric regression can help.

Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
Nature Methods Points of Significance column: Symmetric alternatives to the ordinary least squares regression. Geometry of quantities minimized in OLS and symmetric regression. OLS minimizes `\Sigma e_y^2` in `Y` ~ `X` and `\Sigma e_x^2` `X` ~ `Y`. Pythagorean regression minimizes AB (magenta). Geometric means regression (GMR) minimizes area of ABP (orange). Orthogonal regression (OR) minimizes HP (blue). (read)

Luca Greco, George Luta, Martin Krzywinski & Naomi Altman (2025) Points of significance: Symmetric alternatives to the ordinary least squares regression. Nat. Methods 22:1610–1612.

Beyond Belief Campaign BRCA Art

Wed 11-06-2025

Fuelled by philanthropy, findings into the workings of BRCA1 and BRCA2 genes have led to groundbreaking research and lifesaving innovations to care for families facing cancer.

This set of 100 one-of-a-kind prints explore the structure of these genes. Each artwork is unique — if you put them all together, you get the full sequence of the BRCA1 and BRCA2 proteins.

Propensity score weighting

Mon 17-03-2025

The needs of the many outweigh the needs of the few. —Mr. Spock (Star Trek II)

This month, we explore a related and powerful technique to address bias: propensity score weighting (PSW), which applies weights to each subject instead of matching (or discarding) them.

Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca
Nature Methods Points of Significance column: Propensity score weighting. (read)

Kurz, C.F., Krzywinski, M. & Altman, N. (2025) Points of significance: Propensity score weighting. Nat. Methods 22:638–640.

Martin Krzywinski | contact | Canada's Michael Smith Genome Sciences CentrePHSA
Google whack “vicissitudinal corporealization”
{ 10.9.234.159 }