Predicting the replicability of social and behavioural science claims in COVID-19 preprints

Alexandru Marcoci; David P Wilkinson; Ans Vercammen; Bonnie C Wintle; Anna Lou Abatayo; Ernest Baskin; Henk Berkman; Erin M Buchanan; Sara Capitán; Tabaré Capitán; Ginny Chan; Kent Jason G Cheng; Tom Coupé; Sarah Dryhurst; Jianhua Duan; John E Edlund; Timothy M Errington; Anna Fedor; Fiona Fidler; James G Field; Nicholas Fox; Hannah Fraser; Alexandra L J Freeman; Anca Hanea; Felix Holzmeister; Sanghyun Hong; Raquel Huggins; Nick Huntington-Klein; Magnus Johannesson; Angela M Jones; Hansika Kapoor; John Kerr; Melissa Kline Struhl; Marta Kołczyńska; Yang Liu; Zachary Loomas; Brianna Luis; Esteban Méndez; Olivia Miske; Fallon Mody; Carolin Nast; Brian A Nosek; E Simon Parsons; Thomas Pfeiffer; W Robert Reed; Jon Roozenbeek; Alexa R Schlyfestone; Claudia R Schneider; Andrew Soh; Zhongchen Song; Anirudh Tagat; Melba Tutor; Andrew H Tyner; Karolina Urbanska; Sander van der Linden

doi:10.1038/s41562-024-01961-1

Predicting the replicability of social and behavioural science claims in COVID-19 preprints

Nat Hum Behav. 2024 Dec 20. doi: 10.1038/s41562-024-01961-1. Online ahead of print.

Authors

Alexandru Marcoci^#^{1

2}, David P Wilkinson^#^{3

4}, Ans Vercammen^{3

5

6}, Bonnie C Wintle³, Anna Lou Abatayo⁷, Ernest Baskin⁸, Henk Berkman⁹, Erin M Buchanan¹⁰, Sara Capitán¹¹, Tabaré Capitán¹², Ginny Chan¹³, Kent Jason G Cheng¹⁴, Tom Coupé¹⁵, Sarah Dryhurst^{16

17

18}, Jianhua Duan¹⁹, John E Edlund²⁰, Timothy M Errington²¹, Anna Fedor²², Fiona Fidler³, James G Field²³, Nicholas Fox²¹, Hannah Fraser³, Alexandra L J Freeman¹⁷, Anca Hanea^{3

24}, Felix Holzmeister²⁵, Sanghyun Hong¹⁵, Raquel Huggins¹⁰, Nick Huntington-Klein²⁶, Magnus Johannesson²⁷, Angela M Jones²⁸, Hansika Kapoor^{29

30}, John Kerr^{17

31}, Melissa Kline Struhl³², Marta Kołczyńska³³, Yang Liu³⁴, Zachary Loomas²¹, Brianna Luis²¹, Esteban Méndez³⁵, Olivia Miske²¹, Fallon Mody^{3

36}, Carolin Nast³⁷, Brian A Nosek^{21

38}, E Simon Parsons²¹, Thomas Pfeiffer³⁹, W Robert Reed¹⁵, Jon Roozenbeek¹⁶, Alexa R Schlyfestone¹⁰, Claudia R Schneider^{16

17

40}, Andrew Soh⁴¹, Zhongchen Song⁴², Anirudh Tagat⁴³, Melba Tutor⁴⁴, Andrew H Tyner²¹, Karolina Urbanska⁴⁵, Sander van der Linden¹⁶

Affiliations

¹ Centre for the Study of Existential Risk, University of Cambridge, Cambridge, UK. [email protected].
² School of Politics and International Relations, University of Nottingham, Nottingham, UK. [email protected].
³ MetaMelb Research Initiative, University of Melbourne, Melbourne, Victoria, Australia.
⁴ QAECO, University of Melbourne, Melbourne, Victoria, Australia.
⁵ School of Communication and Arts, The University of Queensland, Brisbane, Queensland, Australia.
⁶ School of Population Health, Curtin University, Bentley, Western Australia, Australia.
⁷ Environmental Economics and Natural Resources Group, Wageningen University and Research, Wageningen, the Netherlands.
⁸ Department of Food, Pharma and Healthcare, Saint Joseph's University, Philadelphia, PA, USA.
⁹ Business School, University of Auckland, Auckland, New Zealand.
¹⁰ Analytics, Harrisburg University of Science and Technology, Harrisburg, PA, USA.
¹¹ Department of Ecology, Swedish University of Agricultural Sciences, Uppsala, Sweden.
¹² Department of Economics, Swedish University of Agricultural Sciences, Uppsala, Sweden.
¹³ Rhizom Psychological Services LLC, Atlanta, GA, USA.
¹⁴ Center for Healthy Aging, The Pennsylvania State University, University Park, PA, USA.
¹⁵ UCMeta, University of Canterbury, Christchurch, New Zealand.
¹⁶ Department of Psychology, University of Cambridge, Cambridge, UK.
¹⁷ Winton Centre for Risk and Evidence Communication, Department of Pure Mathematics and Mathematical Statistics, University of Cambridge, Cambridge, UK.
¹⁸ UCL Institute for Risk and Disaster Reduction, University College London, London, UK.
¹⁹ Statistics New Zealand, Christchurch, New Zealand.
²⁰ Rochester Institute of Technology, Rochester, NY, USA.
²¹ Center for Open Science, Charlottesville, VA, USA.
²² Independent researcher, Budapest, Hungary.
²³ Department of Management, John Chambers School of Business and Economics, West Virginia University, Morgantown, WV, USA.
²⁴ Centre of Excellence for Biosecurity Risk Analysis, University of Melbourne, Melbourne, Victoria, Australia.
²⁵ Department of Economics, University of Innsbruck, Innsbruck, Austria.
²⁶ Seattle University, Seattle, WA, USA.
²⁷ Department of Economics, Stockholm School of Economics, Stockholm, Sweden.
²⁸ School of Criminal Justice and Criminology, Texas State University, San Marcos, TX, USA.
²⁹ Department of Psychology, Monk Prayogshala, Mumbai, India.
³⁰ Neag School of Education, University of Connecticut, Storrs, USA.
³¹ Department of Public Health, University of Otago, Wellington, New Zealand.
³² Massachusetts Institute of Technology, Cambridge, MA, USA.
³³ Institute of Political Studies, Polish Academy of Sciences, Warszawa, Poland.
³⁴ Department of Computer Science and Engineering, University of California, Santa Cruz, Santa Cruz, CA, USA.
³⁵ Central Bank of Costa Rica, San José, Costa Rica.
³⁶ History and Philosophy of Science, University of Melbourne, Melbourne, Victoria, Australia.
³⁷ University of Stavanger, School of Business and Law, Stavanger, Norway.
³⁸ Department of Psychology, University of Virginia, Charlottesville, VA, USA.
³⁹ NZ IAS, Massey University, Auckland, New Zealand.
⁴⁰ School of Psychology, Speech and Hearing, University of Canterbury, Christchurch, New Zealand.
⁴¹ Department of Philosophy, University of Hawaii at Manoa, Honolulu, HI, USA.
⁴² New Zealand Institute of Economic Research (NZIER), Wellington, New Zealand.
⁴³ Department of Economics, Monk Prayogshala, Mumbai, India.
⁴⁴ Independent researcher, Quezon City, Philippines.
⁴⁵ Independent researcher, Sheffield, UK.

^# Contributed equally.

PMID: 39706868
DOI: 10.1038/s41562-024-01961-1

Abstract

Replications are important for assessing the reliability of published findings. However, they are costly, and it is infeasible to replicate everything. Accurate, fast, lower-cost alternatives such as eliciting predictions could accelerate assessment for rapid policy implementation in a crisis and help guide a more efficient allocation of scarce replication resources. We elicited judgements from participants on 100 claims from preprints about an emerging area of research (COVID-19 pandemic) using an interactive structured elicitation protocol, and we conducted 29 new high-powered replications. After interacting with their peers, participant groups with lower task expertise ('beginners') updated their estimates and confidence in their judgements significantly more than groups with greater task expertise ('experienced'). For experienced individuals, the average accuracy was 0.57 (95% CI: [0.53, 0.61]) after interaction, and they correctly classified 61% of claims; beginners' average accuracy was 0.58 (95% CI: [0.54, 0.62]), correctly classifying 69% of claims. The difference in accuracy between groups was not statistically significant and their judgements on the full set of claims were correlated (r(98) = 0.48, P < 0.001). These results suggest that both beginners and more-experienced participants using a structured process have some ability to make better-than-chance predictions about the reliability of 'fast science' under conditions of high uncertainty. However, given the importance of such assessments for making evidence-based critical decisions in a crisis, more research is required to understand who the right experts in forecasting replicability are and how their judgements ought to be elicited.

Abstract

Grants and funding