August 31, 2015

The Bayesian Reproducibility Project→

August 31, 2015/ Jonathan

Alexander Etz on why we need a better metric for "success" in reproducibility.

Based on these two metrics, the headlines are accurate: Over half of the replications “failed”. But these two reproducibility metrics are either invalid (comparing significance levels across experiments) or very vague (confidence interval agreement). They also only offer binary answers: A replication either “succeeds” or “fails”, and this binary thinking leads to absurd conclusions in some cases like those mentioned above. Is replicability really so black and white? I will explain below how I think we should measure replicability in a Bayesian way, with a continuous measure that can find reasonable answers with replication effects near zero with wide CIs, effects near the original with tight CIs, effects near zero with tight CIs, replication effects that go in the opposite direction, and anything in between.

Jonathan Peelle, PhD
Professor
CV: PeelleCV.pdf
Google Scholar profile
Lab: peellelab.org
E-mail: j.peelle123@northeastern.edu(without the 123)

("Peelle" sounds like "peel". I blame my ancestors for the extra letters.)

Jonathan Peelle

Jonathan Peelle

Linked

Jonathan Peelle

The Bayesian Reproducibility Project→

Jonathan Peelle

Categories

Tags