The STOIC2021 COVID-19 AI challenge: Applying reusable training methodologies to private data

Luuk H Boulogne; Julian Lorenz; Daniel Kienzle; Robin Schön; Katja Ludwig; Rainer Lienhart; Simon Jégou; Guang Li; Cong Chen; Qi Wang; Derik Shi; Mayug Maniparambil; Dominik Müller; Silvan Mertes; Niklas Schröter; Fabio Hellmann; Miriam Elia; Ine Dirks; Matías Nicolás Bossa; Abel Díaz Berenguer; Tanmoy Mukherjee; Jef Vandemeulebroucke; Hichem Sahli; Nikos Deligiannis; Panagiotis Gonidakis; Ngoc Dung Huynh; Imran Razzak; Reda Bouadjenek; Mario Verdicchio; Pasquale Borrelli; Marco Aiello; James A Meakin; Alexander Lemm; Christoph Russ; Razvan Ionasec; Nikos Paragios; Bram van Ginneken; Marie-Pierre Revel

doi:10.1016/j.media.2024.103230

The STOIC2021 COVID-19 AI challenge: Applying reusable training methodologies to private data

Med Image Anal. 2024 Oct:97:103230. doi: 10.1016/j.media.2024.103230. Epub 2024 Jun 5.

Authors

Luuk H Boulogne¹, Julian Lorenz², Daniel Kienzle³, Robin Schön³, Katja Ludwig³, Rainer Lienhart³, Simon Jégou⁴, Guang Li⁵, Cong Chen⁶, Qi Wang⁶, Derik Shi⁶, Mayug Maniparambil⁷, Dominik Müller⁸, Silvan Mertes⁹, Niklas Schröter⁹, Fabio Hellmann⁹, Miriam Elia¹⁰, Ine Dirks¹¹, Matías Nicolás Bossa¹², Abel Díaz Berenguer¹², Tanmoy Mukherjee¹², Jef Vandemeulebroucke¹², Hichem Sahli¹², Nikos Deligiannis¹², Panagiotis Gonidakis¹², Ngoc Dung Huynh¹³, Imran Razzak¹⁴, Reda Bouadjenek¹³, Mario Verdicchio¹⁵, Pasquale Borrelli¹⁶, Marco Aiello¹⁶, James A Meakin¹⁷, Alexander Lemm¹⁸, Christoph Russ¹⁸, Razvan Ionasec¹⁸, Nikos Paragios¹⁹, Bram van Ginneken¹⁷, Marie-Pierre Revel²⁰

Affiliations

¹ Radboud university medical center, P.O. Box 9101, 6500HB Nijmegen, The Netherlands. Electronic address: [email protected].
² University of Augsburg, Universitätsstraße 2, 86159 Augsburg, Germany. Electronic address: [email protected].
³ University of Augsburg, Universitätsstraße 2, 86159 Augsburg, Germany.
⁴ Independent researcher. Electronic address: [email protected].
⁵ Keya medical technology co. ltd, Floor 20, Building A, 1 Ronghua South Road, Yizhuang Economic Development Zone, Daxing District, Beijing, PR China. Electronic address: [email protected].
⁶ Keya medical technology co. ltd, Floor 20, Building A, 1 Ronghua South Road, Yizhuang Economic Development Zone, Daxing District, Beijing, PR China.
⁷ ML-Labs, Dublin City University, N210, Marconi building, Dublin City University, Glasnevin, Dublin 9, Ireland. Electronic address: [email protected].
⁸ University of Augsburg, Universitätsstraße 2, 86159 Augsburg, Germany; Faculty of Applied Computer Science, University of Augsburg, Germany.
⁹ Faculty of Applied Computer Science, University of Augsburg, Germany.
¹⁰ Faculty of Applied Computer Science, University of Augsburg, Germany. Electronic address: [email protected].
¹¹ Vrije Universiteit Brussel, Department of Electronics and Informatics, Pleinlaan 2, 1050 Brussels, Belgium; imec, Kapeldreef 75, 3001 Leuven, Belgium. Electronic address: [email protected].
¹² Vrije Universiteit Brussel, Department of Electronics and Informatics, Pleinlaan 2, 1050 Brussels, Belgium; imec, Kapeldreef 75, 3001 Leuven, Belgium.
¹³ Deakin University, Geelong, Australia.
¹⁴ University of New South Wales, Sydney, Australia. Electronic address: [email protected].
¹⁵ IRCCS SYNLAB SDN, Naples, Italy. Electronic address: [email protected].
¹⁶ IRCCS SYNLAB SDN, Naples, Italy.
¹⁷ Radboud university medical center, P.O. Box 9101, 6500HB Nijmegen, The Netherlands.
¹⁸ Amazon Web Services, Marcel-Breuer-Str. 12, 80807 München, Germany.
¹⁹ Keya medical technology co. ltd, Floor 20, Building A, 1 Ronghua South Road, Yizhuang Economic Development Zone, Daxing District, Beijing, PR China; TheraPanacea, 75004, Paris, France.
²⁰ Department of Radiology, Université de Paris, APHP, Hôpital Cochin, 27 rue du Fg Saint Jacques, 75014 Paris, France.

PMID: 38875741
DOI: 10.1016/j.media.2024.103230

Abstract

Challenges drive the state-of-the-art of automated medical image analysis. The quantity of public training data that they provide can limit the performance of their solutions. Public access to the training methodology for these solutions remains absent. This study implements the Type Three (T3) challenge format, which allows for training solutions on private data and guarantees reusable training methodologies. With T3, challenge organizers train a codebase provided by the participants on sequestered training data. T3 was implemented in the STOIC2021 challenge, with the goal of predicting from a computed tomography (CT) scan whether subjects had a severe COVID-19 infection, defined as intubation or death within one month. STOIC2021 consisted of a Qualification phase, where participants developed challenge solutions using 2000 publicly available CT scans, and a Final phase, where participants submitted their training methodologies with which solutions were trained on CT scans of 9724 subjects. The organizers successfully trained six of the eight Final phase submissions. The submitted codebases for training and running inference were released publicly. The winning solution obtained an area under the receiver operating characteristic curve for discerning between severe and non-severe COVID-19 of 0.815. The Final phase solutions of all finalists improved upon their Qualification phase solutions.

Keywords: COVID-19; Machine learning; Medical image analysis challenge.

MeSH terms

Artificial Intelligence
COVID-19*
Humans
SARS-CoV-2*
Tomography, X-Ray Computed*