data visualization + art

Creating the Molecular Case Studies Cover

If your photos aren’t good enough, then you’re not close enough
— Robert Capa

Molecualar Case Studies - Creating the April 2018 cover / Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca

▲ Cover design for Apr 2018 issue of Molecular Case Studies. (zoom)

about the cover

Papillary thyroid carcinoma (PTC) cells, even though malignant, are still genetically programmed to try to be thyroid follicles and may retain their follicular growth pattern, which appear as circles on cross section. Two diagnostic features of papillary thyroid carcinoma are nuclear clearing and intranuclear cytoplasmic inclusions. The black-and-white image is an artistic treatment of a PTC microscopy image (40×) from one of the Personalized Oncogenomics Program study participants at the BC Cancer Research Center. Superimposed is a Circos plot of 17 genomic fusions involving 17 chromosomes identified in the sample by whole-genome sequencing. Showing through the Circos plot is an enhanced color version of the microscopy image. The original image is from Application of genomics to identify therapeutic targets in recurrent pediatric papillary thyroid carcinoma by Ronsley et al. in the April 2018 issue.

The theme of the April issue of Molecular Case Studies is precision oncogenomics. We have three papers in the issue based on work done in our Personalized Oncogenomics Program (POG).

...this special issue provide[s] a glimpse into current cancer precision medicine efforts, reflecting only a microcosm of ... genomics in this bustling space of clinical translation.
John C. Carpten & Elaine R. Mardis
The era of precision oncogenomics
Mol. Case Stud. (2018) 4(2).

I've previously created art based on POG data—posters to celebrate the program's 5-year anniversary.

input materials

The covers of Molecular Case Studies typically show microscopy images, with some shown in a more abstract fashion. There's also the occasional Circos plot.

I've previously taken a more fine-art approach to cover design, such for those of Nature, Genome Research and Trends in Genetics. I've used microscopy images to create a cover for PNAS—the one that made biology look like astrophysics—and thought that this is kind of material I'd start with for the MCS cover.

▲ A few of the microscopy slides submitted to me for the cover design. Courtesy of Anna Lee (Dept Pathology and Laboratory Medicine, UBC).

When I look at these kind of images, I have basically no idea what I'm looking at. Sure, I know this is life at tiny scale but I am not a pathologist. This helps me greatly.

Instead, I see color, shapes, and contrast. I hunt for patterns that would make for an interesting visual, without necessarily trying to communicate any of the science behind that—the paper does a much better job at this than I ever could. It's largely a process driven by intuition and my desire to see distinct visual patterns at different length scales with some symmetry, ideally broken in a pleasing way. Vague, I know.

Images of different regions of the same slide, at the same magnification, can have very different levels of visual engagement (for the non-specialist). Just compare the two images below.

▲ Same magnification, same slide. The image on the right is interesting. The one on the left is not, artistically speaking. Courtesy of Anna Lee (Dept Pathology and Laboratory Medicine, UBC).

The slide on the left really caught my eye. It had the right proportion of tiny, small, medium and large things.

▲ I see a heart, a face and lava flow among faces. Obviously, faces everywhere—humans are good at those kinds of Type I errors. The panels below are 100% crops of the 40× slide of papillary thyroid carcinoma shown above them. Courtesy of Anna Lee (Dept Pathology and Laboratory Medicine, UBC).

black and white version

The black-and-white version was obtained by solarizing the image. There are both color and black-and-white options for solarization, a method in which various tones of the image are remapped in brightness.

▲ (A) The original slide (B) A black-and-white composition using Nik Color Efex 4 filters applied in succession to the slide: dark contrast, tonal contrast, white neutralizer and solarization.

▲ Compounding effects of each of the filters on the image above.

And here's the first black-and-white take.

▲ The initial black-and-white composition after applying Nik filters.

This looked good but a bit dark. I handled this by lightening the tone, differently depending on the element in the image. I also wanted to bring out more details in the internal structure of the cells. This was achieved by applying an otherwise aggressive sharpening mask.

▲ The effect of additional sharpening and tone remaps, applied differently to intracellular and extracellular regions.

I was quite happy with this result. The combination of solarization and sharpening created a large variety of patterns inside the cells. My brain fought hard to see faces in them.

▲ 100% crops of regions of the above black-and-white image. I see a heart (this is the same heart region shown in the color crop above), then a some kind of dog/cat chimera, in the last panel, a suprrised or scared camel. If you look very carefully, you can see a grumpy cat coming out of the heart.

Because I had slides at different magnifications, I created a design in which three slides at 10, 20 and 40 × were composited together so that from left to right the magnification increased across the image. The effect is subtle—you can easily miss it, which is the point.

▲ A seamless stitch of black-and-white treatments of 10, 20 and 40 × slides. As you go from left to right, the magnification increases.

I had pretty high hopes for these black-and-white versions. Previous covers in MCS have been colorful, though, so I thought to provide a color option.

color version

For the color version, I wanted to give the colors more punch. For sure.

I also wanted to emphasize the details, like for the black-and-white image.

The first process step of the color slide was done using 5 Nik filters, applied in succession: dark contrast, tonal contrast, sunlight, polarization and detail extractor. The effects of the stack of these filters is shown on the original image below. The whole image is shown and in each strip the filters are stacked.

▲ The effect of stacking 5 Nik filters on the original image.

Here's the full image with the 5 Nik filters applied.

▲ A seamless stitch of black-and-white treatments of 10, 20 and 40 × slides. As you go from left to right, the magnification increases.

Not there yet, though. I added more sharpening (more than I've ever used before, so I felt a little weird, but got over it quickly). The colors were punched up too—I wanted more contrast between the blue and red areas and transform the reds a little into oranges.

▲ A seamless stitch of black-and-white treatments of 10, 20 and 40 × slides. As you go from left to right, the magnification increases.

If it looks like the blue areas are popping out of the image, that's the effect of the emboss filter.

final composition

The editors asked me to encorporate a Circos image in the final design. This was tricky—I had spent a lot of time up to now fiddling with extracting patterns and textures from the images.

Something as geometrical and rational as a data graphic would alter the personality of the design. But, the goal of artistic collaboration is always to find a way, so I took some gene fusions that were found in the sample with our structural variant pipeline and created a bare-bones Circos image out of them.

▲ A seamless stitch of black-and-white treatments of 10, 20 and 40 × slides. As you go from left to right, the magnification increases.

This was then superimposed on the image and emphasized by using the color design inside the circle and black-and-white design outside.

▲ The final composition for the cover combines both black-and-white and color treatments. The colored pattern stands out above the black-and-white background.

It's always fun to invert images and see what happens.

▲ Inverse of the above. Notice how the pattern inside the circle appears to be sitting below the plane, making the circle more of a window to a scene. .

VIEW ALL

news + thoughts

Propensity score matching

Mon 16-09-2024

I don’t have good luck in the match points. —Rafael Nadal, Spanish tennis player

In many experimental designs, we need to keep in mind the possibility of confounding variables, which may give rise to bias in the estimate of the treatment effect.

Martin Krzywinski @MKrzywinski mkweb.bcgsc.ca

▲ Nature Methods Points of Significance column: Propensity score matching. (read)

If the control and experimental groups aren't matched (or, roughly, similar enough), this bias can arise.

Sometimes this can be dealt with by randomizing, which on average can balance this effect out. When randomization is not possible, propensity score matching is an excellent strategy to match control and experimental groups.

Kurz, C.F., Krzywinski, M. & Altman, N. (2024) Points of significance: Propensity score matching. Nat. Methods 21:1770–1772.

Nasa to send our human genome discs to the Moon

Sat 23-03-2024

We'd like to say a ‘cosmic hello’: mathematics, culture, palaeontology, art and science, and ... human genomes.

▲ SANCTUARY PROJECT | A cosmic hello of art, science, and genomes. (details)

▲ SANCTUARY PROJECT | Benoit Faiveley, founder of the Sanctuary project gives the Sanctuary disc a visual check at CEA LeQ Grenoble (image: Vincent Thomas). (details)

▲ SANCTUARY PROJECT | Sanctuary team examines the Life disc at INRIA Paris Saclay (image: Benedict Redgrove) (details)

Comparing classifier performance with baselines

Fri 22-03-2024

All animals are equal, but some animals are more equal than others. —George Orwell

This month, we will illustrate the importance of establishing a baseline performance level.

Baselines are typically generated independently for each dataset using very simple models. Their role is to set the minimum level of acceptable performance and help with comparing relative improvements in performance of other models.

▲ Nature Methods Points of Significance column: Comparing classifier performance with baselines. (read)

Unfortunately, baselines are often overlooked and, in the presence of a class imbalance, must be established with care.

Megahed, F.M, Chen, Y-J., Jones-Farmer, A., Rigdon, S.E., Krzywinski, M. & Altman, N. (2024) Points of significance: Comparing classifier performance with baselines. Nat. Methods 21:546–548.

Happy 2024 π Day—
sunflowers ho!

Sat 09-03-2024

Celebrate π Day (March 14th) and dig into the digit garden. Let's grow something.

▲ 2024 π DAY | A garden of 1,000 digits of π. (details)

How Analyzing Cosmic Nothing Might Explain Everything

Thu 18-01-2024

Huge empty areas of the universe called voids could help solve the greatest mysteries in the cosmos.

My graphic accompanying How Analyzing Cosmic Nothing Might Explain Everything in the January 2024 issue of Scientific American depicts the entire Universe in a two-page spread — full of nothing.

▲ How Analyzing Cosmic Nothing Might Explain Everything. Text by Michael Lemonick (editor), art direction by Jen Christiansen (Senior Graphics Editor), source: SDSS

The graphic uses the latest data from SDSS 12 and is an update to my Superclusters and Voids poster.

Michael Lemonick (editor) explains on the graphic:

“Regions of relatively empty space called cosmic voids are everywhere in the universe, and scientists believe studying their size, shape and spread across the cosmos could help them understand dark matter, dark energy and other big mysteries.

To use voids in this way, astronomers must map these regions in detail—a project that is just beginning.

Shown here are voids discovered by the Sloan Digital Sky Survey (SDSS), along with a selection of 16 previously named voids. Scientists expect voids to be evenly distributed throughout space—the lack of voids in some regions on the globe simply reﬂects SDSS’s sky coverage.”

voids

Sofia Contarini, Alice Pisani, Nico Hamaus, Federico Marulli Lauro Moscardini & Marco Baldi (2023) Cosmological Constraints from the BOSS DR12 Void Size Function Astrophysical Journal 953:46.

Nico Hamaus, Alice Pisani, Jin-Ah Choi, Guilhem Lavaux, Benjamin D. Wandelt & Jochen Weller (2020) Journal of Cosmology and Astroparticle Physics 2020:023.

Sloan Digital Sky Survey Data Release 12

constellation figures

Alan MacRobert (Sky & Telescope), Paulina Rowicka/Martin Krzywinski (revisions & Microscopium)

stars

Hoffleit & Warren Jr. (1991) The Bright Star Catalog, 5th Revised Edition (Preliminary Version).

cosmology

H₀ = 67.4 km/(Mpc·s), Ω_m = 0.315, Ω_v = 0.685. Planck collaboration Planck 2018 results. VI. Cosmological parameters (2018).

Error in predictor variables

Tue 02-01-2024

It is the mark of an educated mind to rest satisfied with the degree of precision that the nature of the subject admits and not to seek exactness where only an approximation is possible. —Aristotle

In regression, the predictors are (typically) assumed to have known values that are measured without error.

Practically, however, predictors are often measured with error. This has a profound (but predictable) effect on the estimates of relationships among variables – the so-called “error in variables” problem.

▲ Nature Methods Points of Significance column: Error in predictor variables. (read)

Error in measuring the predictors is often ignored. In this column, we discuss when ignoring this error is harmless and when it can lead to large bias that can leads us to miss important effects.

Altman, N. & Krzywinski, M. (2024) Points of significance: Error in predictor variables. Nat. Methods 21:4–6.

Background reading

Altman, N. & Krzywinski, M. (2015) Points of significance: Simple linear regression. Nat. Methods 12:999–1000.

Lever, J., Krzywinski, M. & Altman, N. (2016) Points of significance: Logistic regression. Nat. Methods 13:541–542 (2016).

Das, K., Krzywinski, M. & Altman, N. (2019) Points of significance: Quantile regression. Nat. Methods 16:451–452.

data visualization + art

Creating the Molecular Case Studies Cover

about the cover

input materials

black and white version

color version

final composition

Propensity score matching

Nasa to send our human genome discs to the Moon

Comparing classifier performance with baselines

Happy 2024 π Day—sunflowers ho!

How Analyzing Cosmic Nothing Might Explain Everything

voids

constellation figures

stars

cosmology

Error in predictor variables

Background reading

Happy 2024 π Day—
sunflowers ho!