Implementation of the COVID-19 Vulnerability Index Across an International Network of Health Care Data Sets: Collaborative External Validation Study

Jenna M Reps; Chungsoo Kim; Ross D Williams; Aniek F Markus; Cynthia Yang; Talita Duarte-Salles; Thomas Falconer; Jitendra Jonnagaddala; Andrew Williams; Sergio Fernández-Bertolín; Scott L DuVall; Kristin Kostka; Gowtham Rao; Azza Shoaibi; Anna Ostropolets; Matthew E Spotnitz; Lin Zhang; Paula Casajust; Ewout W Steyerberg; Fredrik Nyberg; Benjamin Skov Kaas-Hansen; Young Hwa Choi; Daniel Morales; Siaw-Teng Liaw; Maria Tereza Fernandes Abrahão; Carlos Areia; Michael E Matheny; Kristine E Lynch; María Aragón; Rae Woong Park; George Hripcsak; Christian G Reich; Marc A Suchard; Seng Chan You; Patrick B Ryan; Daniel Prieto-Alhambra; Peter R Rijnbeek

doi:10.2196/21547

Implementation of the COVID-19 Vulnerability Index Across an International Network of Health Care Data Sets: Collaborative External Validation Study

JMIR Med Inform. 2021 Apr 5;9(4):e21547. doi: 10.2196/21547.

Authors

Jenna M Reps¹, Chungsoo Kim², Ross D Williams³, Aniek F Markus³, Cynthia Yang³, Talita Duarte-Salles⁴, Thomas Falconer⁵, Jitendra Jonnagaddala⁶, Andrew Williams⁷, Sergio Fernández-Bertolín⁴, Scott L DuVall⁸, Kristin Kostka⁹, Gowtham Rao¹, Azza Shoaibi¹, Anna Ostropolets⁵, Matthew E Spotnitz⁵, Lin Zhang^{10

11}, Paula Casajust¹², Ewout W Steyerberg^{13

14}, Fredrik Nyberg¹⁵, Benjamin Skov Kaas-Hansen^{16

17}, Young Hwa Choi¹⁸, Daniel Morales¹⁹, Siaw-Teng Liaw⁶, Maria Tereza Fernandes Abrahão²⁰, Carlos Areia²¹, Michael E Matheny²², Kristine E Lynch⁸, María Aragón⁴, Rae Woong Park²³, George Hripcsak⁵, Christian G Reich⁹, Marc A Suchard²⁴, Seng Chan You²³, Patrick B Ryan¹, Daniel Prieto-Alhambra²⁵, Peter R Rijnbeek³

Affiliations

¹ Janssen Research & Development, Titusville, NJ, United States.
² Department of Biomedical Sciences, Ajou University Graduate School of Medicine, Suwon, Republic of Korea.
³ Department of Medical Informatics, Erasmus University Medical Center, Rotterdam, Netherlands.
⁴ Fundacio Institut Universitari per a la recerca a l'Atencio Primaria de Salut Jordi Gol i Gurina, Barcelona, Spain.
⁵ Department of Biomedical Informatics, Columbia University, New York, NY, United States.
⁶ School of Public Health and Community Medicine, University of New South Wales, Sydney, Australia.
⁷ Tufts Institute for Clinical Research and Health Policy Studies, Boston, MA, United States.
⁸ Department of Veterans Affairs, University of Utah, Salt Lake City, UT, United States.
⁹ Real World Solutions, IQVIA, Cambridge, MA, United States.
¹⁰ Melbourne School of Public Health, The University of Melbourne, Victoria, Australia.
¹¹ School of Public Health, Peking Union Medical College, Beijing, China.
¹² Department of Real-World Evidence, Trial Form Support, Barcelona, Spain.
¹³ Department of Public Health, Erasmus University Medical Center, Rotterdam, Netherlands.
¹⁴ Department of Biomedical Data Sciences, Leiden University Medical Center, Leiden, Netherlands.
¹⁵ School of Public Health and Community Medicine, Institute of Medicine, Sahlgrenska Academy, University of Gothenburg, Gothenburg, Sweden.
¹⁶ Clinical Pharmacology Unit, Zealand University Hospital, Roskilde, Denmark.
¹⁷ NNF Centre for Protein Research, University of Copenhagen, Copenhagen, Denmark.
¹⁸ Department of Infectious Diseases, Ajou University School of Medicine, Suwon, Republic of Korea.
¹⁹ Division of Population Health and Genomics, University of Dundee, Dundee, United Kingdom.
²⁰ Faculty of Medicine, University of Sao Paulo, Sao Paulo, Brazil.
²¹ Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, United Kingdom.
²² Department of Veterans Affairs, Vanderbilt University, Nashville, TN, United States.
²³ Department of Biomedical Informatics, Ajou University School of Medicine, Suwon, Republic of Korea.
²⁴ Department of Biostatistics, UCLA Fielding School of Public Health, University of California, Los Angeles, CA, United States.
²⁵ Centre for Statistics in Medicine, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Oxford, United Kingdom.

PMID: 33661754
PMCID: PMC8023380
DOI: 10.2196/21547

Abstract

Background: SARS-CoV-2 is straining health care systems globally. The burden on hospitals during the pandemic could be reduced by implementing prediction models that can discriminate patients who require hospitalization from those who do not. The COVID-19 vulnerability (C-19) index, a model that predicts which patients will be admitted to hospital for treatment of pneumonia or pneumonia proxies, has been developed and proposed as a valuable tool for decision-making during the pandemic. However, the model is at high risk of bias according to the "prediction model risk of bias assessment" criteria, and it has not been externally validated.

Objective: The aim of this study was to externally validate the C-19 index across a range of health care settings to determine how well it broadly predicts hospitalization due to pneumonia in COVID-19 cases.

Methods: We followed the Observational Health Data Sciences and Informatics (OHDSI) framework for external validation to assess the reliability of the C-19 index. We evaluated the model on two different target populations, 41,381 patients who presented with SARS-CoV-2 at an outpatient or emergency department visit and 9,429,285 patients who presented with influenza or related symptoms during an outpatient or emergency department visit, to predict their risk of hospitalization with pneumonia during the following 0-30 days. In total, we validated the model across a network of 14 databases spanning the United States, Europe, Australia, and Asia.

Results: The internal validation performance of the C-19 index had a C statistic of 0.73, and the calibration was not reported by the authors. When we externally validated it by transporting it to SARS-CoV-2 data, the model obtained C statistics of 0.36, 0.53 (0.473-0.584) and 0.56 (0.488-0.636) on Spanish, US, and South Korean data sets, respectively. The calibration was poor, with the model underestimating risk. When validated on 12 data sets containing influenza patients across the OHDSI network, the C statistics ranged between 0.40 and 0.68.

Conclusions: Our results show that the discriminative performance of the C-19 index model is low for influenza cohorts and even worse among patients with COVID-19 in the United States, Spain, and South Korea. These results suggest that C-19 should not be used to aid decision-making during the COVID-19 pandemic. Our findings highlight the importance of performing external validation across a range of settings, especially when a prediction model is being extrapolated to a different population. In the field of prediction, extensive validation is required to create appropriate trust in a model.

Keywords: C-19; COVID-19; bias; datasets; decision-making; external validation; hospitalization; modeling; observation; prediction; prognostic model; risk; transportability.

©Jenna M Reps, Chungsoo Kim, Ross D Williams, Aniek F Markus, Cynthia Yang, Talita Duarte-Salles, Thomas Falconer, Jitendra Jonnagaddala, Andrew Williams, Sergio Fernández-Bertolín, Scott L DuVall, Kristin Kostka, Gowtham Rao, Azza Shoaibi, Anna Ostropolets, Matthew E Spotnitz, Lin Zhang, Paula Casajust, Ewout W Steyerberg, Fredrik Nyberg, Benjamin Skov Kaas-Hansen, Young Hwa Choi, Daniel Morales, Siaw-Teng Liaw, Maria Tereza Fernandes Abrahão, Carlos Areia, Michael E Matheny, Kristine E Lynch, María Aragón, Rae Woong Park, George Hripcsak, Christian G Reich, Marc A Suchard, Seng Chan You, Patrick B Ryan, Daniel Prieto-Alhambra, Peter R Rijnbeek. Originally published in JMIR Medical Informatics (http://medinform.jmir.org), 05.04.2021.

Abstract

Grants and funding