2024 π Daylatest newsbuy art
Thoughts rearrange, familiar now strange.Holly Golightly & The Greenhornes break flowersmore quotes
data visualization + public health

The COVID Charts

Observations on data visualizations of the coronavirus outbreak

The COVID Charts are brief critiques of data visualization and science communication of the coronavirus outbreak. They are not statements about the underlying science or public health policy.

If you would like me to critique a specific chart, get in touch.

The COVID Charts -- Observations on data visualizations of the coronavirus outbreak -- Martin Krzywinski

Inaccurate, sloppy and illegible . A slide from the presentation that explains the goals of community mitigation by comparing the time progress of daily cases with and without intervention. The chart projects that by “flattening the curve” deaths can be reduced from 1.5–2.2 million to 100,000–240,000. (White House Coronavirus Task Force, 31 March 2020).

31 March 2020
background
[https://en.wikipedia.org/wiki/White_House_Coronavirus_Task_Force](The Coronavirus Taskforce) uses press briefings to communicate updates, guidelines, and policy changes to the public during the 2020 COVID-19 pandemic in the US.
core message
On charts that are illustrative, shapes, trends and proportions should be reflected as accurately as possible. This is easy to get wrong when not using curves from the source models but rather attempting to depict the data qualitatively.
key guidelines
1. Use an appropriate data generation model even for illustrative graphics.
2. Never deceive the reader by drawing proportions that don't accurately reflect your message.
3. Avoid textures and other chart junk.

qualitatively inaccurate

The chart shows a normal distribution but epidemiological curves are typically asymmeric. This can be seen in a comparison of the infected fraction from a SIR model with R_0 = 2 model, scaled to have the same maximum.

Figure 1

Do not use a normal distribution to illustrate a data generation process that isn't normal. In this case, a simple SIR model curve is easy to generate and has the benefits authentically reflecting the change in shape as R__0 changes.

If correct shapes aren’t shown, important trends can be missed: as parameters vary, asymmetries or a changes in shape can arise that hint at the underlying mechanisms.

quantitatively inaccurate

More importantly, the relative size of curves does not reflect the actual numbers on the chart. The smaller curve has an area that is about 30% of the larger curve. As shown, it therefore represents 500,000 to 660,000 deaths. Always use a quantitative model even if the design calls for an illustrative comparison.

Figure 2

Accurately demonstrate proportions — failing to do so will give the reader an incorrect impression of how the data change. The curves on the plot imply a scenario in which cases are reduced to 11% but their actual shape corresponds to a reduction to 30%. This is akin to drawing two lines, one half the length of the other, and saying that it is actually one-sixth the length.

In order for the smaller curve to reflect the number of deaths shown on the slide, its area would have to be about 11% of the large curve. Without the correct scaling, it is easy to underestimate the degree of mitigation required for the depicted drops in deaths.

sloppy

The ranges of deaths labelling the two curves are formatted differently:

1,500,000 - 2,200,000

100-240,000

In both cases, a dash is used as a separator — for ranges, always use the n-dash (–) whose length helps separate the numbers.

illegible

Never use textures for fill. Textures disappear and shimmer and are “chart junk”.

For curves filled with color, apply transparency or color blend to emphasize their overlap. If the curve in the back is darker, superimpose a lightly tinted copy of it on top of the foreground (e.g. 15% opacity). This may achieve the desired effect if you don’t have access to blend modes.

Figure 3

Avoid textures at all cost. They are chart junk. Use color blends or a light tint overlay to mix the curve colors where they overlap. A subtle way to emphasize overlapping curves is to apply a thin white stroke to the top-most curve.

Martin Krzywinski | contact | Canada's Michael Smith Genome Sciences CentreBC Cancer Research CenterBC CancerPHSA
Google whack “vicissitudinal corporealization”
{ 10.9.234.152 }