Martin Krzywinski is a staff scientist at Canada’s Michael Smith Genome Sciences Centre.
Naomi Altman is a Professor of Statistics at The Pennsylvania State University.
Paul Blainey is an Assistant Professor of Biological Engineering at MIT and Core Member of the Broad Institute.
Danilo Bzdok is an Assistant Professor at the Department of Psychiatry, RWTH Aachen University, Germany, and a Visiting Professor at INRIA/Neurospin Saclay in France.
Kiranmoy Das is a faculty member at the Indian Statistical Institute in Kolkata, India.
Luca Greco is an Assistant Professor of Statistics at the University of Sannio in Benevento, Italy.
Jasleen Grewal is a graduate student in the Jones lab at Canada's Michael Smith Genome Sciences Centre.
Anthony Kulesa is a graduate student in the Department of Biological Engineering at MIT.
Jake Lever is a Postdoctoral Research Fellow in Bioengineering at Stanford University in Stanford, California, USA.
Geroge Luta Associate Professor of Biostatistics at the Georgetown University in Washington, DC, USA.
Jorge López Puga is a Professor of Research Methodology at UCAM Universidad Católica de Murcia.
Byran Smucker is an Associate Professor of Statistics at Miami University in Oxford, OH, USA.
Bernhard Voelkl is a Postdoctoral Research Fellow in the Division of Animal Welfare at the Veterinary Public Health Institute, University of Bern, Bern, Switzerland
Hanno Würbel is a Professor in the Division of Animal Welfare at the Veterinary Public Health Institute, University of Bern, Bern, Switzerland
I don’t have good luck in the match points. —Rafael Nadal, Spanish tennis player
In many experimental designs, we need to keep in mind the possibility of confounding variables, which may give rise to bias in the estimate of the treatment effect.
If the control and experimental groups aren't matched (or, roughly, similar enough), this bias can arise.
Sometimes this can be dealt with by randomizing, which on average can balance this effect out. When randomization is not possible, propensity score matching is an excellent strategy to match control and experimental groups.
Kurz, C.F., Krzywinski, M. & Altman, N. (2024) Points of significance: Propensity score matching. Nat. Methods 21:1770–1772.
We'd like to say a ‘cosmic hello’: mathematics, culture, palaeontology, art and science, and ... human genomes.
All animals are equal, but some animals are more equal than others. —George Orwell
This month, we will illustrate the importance of establishing a baseline performance level.
Baselines are typically generated independently for each dataset using very simple models. Their role is to set the minimum level of acceptable performance and help with comparing relative improvements in performance of other models.
Unfortunately, baselines are often overlooked and, in the presence of a class imbalance, must be established with care.
Megahed, F.M, Chen, Y-J., Jones-Farmer, A., Rigdon, S.E., Krzywinski, M. & Altman, N. (2024) Points of significance: Comparing classifier performance with baselines. Nat. Methods 21:546–548.
Celebrate π Day (March 14th) and dig into the digit garden. Let's grow something.