RSNA 2023 Abdominal Trauma AI Challenge Review and Outcomes Analysis

Sebastiaan Hermans; Zixuan Hu; Robyn L Ball; Hui Ming Lin; Luciano M Prevedello; Ferco H Berger; Ibrahim Yusuf; Jeffrey D Rudie; Maryam Vazirabad; Adam E Flanders; George Shih; John Mongan; Savvas Nicolaou; Brett S Marinelli; Melissa A Davis; Kirti Magudia; Ervin Sejdić; Errol Colak

doi:10.1148/ryai.240334

RSNA 2023 Abdominal Trauma AI Challenge Review and Outcomes Analysis

Radiol Artif Intell. 2024 Nov 6:e240334. doi: 10.1148/ryai.240334. Online ahead of print.

Authors

Sebastiaan Hermans^#¹, Zixuan Hu^#¹, Robyn L Ball¹, Hui Ming Lin¹, Luciano M Prevedello¹, Ferco H Berger¹, Ibrahim Yusuf¹, Jeffrey D Rudie¹, Maryam Vazirabad¹, Adam E Flanders¹, George Shih¹, John Mongan¹, Savvas Nicolaou¹, Brett S Marinelli¹, Melissa A Davis¹, Kirti Magudia¹, Ervin Sejdić¹, Errol Colak¹

Affiliation

¹ From the Department of Medical Imaging, St. Michael's Hospital, Unity Health Toronto, 30 Bond St, Toronto, ON, Canada, M5B 1W8 (S.H., Z.H., H.M.L., I.Y., E.C.); Edward S. Rogers Department of Electrical and Computer Engineering, University of Toronto, Toronto, Ontario, Canada (Z.H., E.S.); The Jackson Laboratory, Bar Harbor, Me (R.L.B.); Department of Radiology, The Ohio State University, Columbus, Ohio (L.M.P.); Department of Medical Imaging, Sunnybrook Health Sciences Centre, University of Toronto, Toronto, Ontario, Canada (F.H.B.); Department of Radiology, Scripps Clinic Medical Group and University of California San Diego, San Diego, Calif (J.D.R.); Radiological Society of North America, Oak Brook, Ill (M.V.); Department of Radiology, Thomas Jefferson University, Philadelphia, Pa (A.E.F.); Department of Radiology, Weill Cornell Medicine, New York, NY (G.S.); Department of Radiology and Biomedical Imaging, University of California San Francisco, San Francisco, Calif (J.M.), Department of Radiology, Vancouver General Hospital, Vancouver, Canada (S.N.); Department of Radiology, Memorial Sloan-Kettering Cancer Center, New York, NY (B.S.M.); Department of Radiology and Biomedical Imaging, Yale University School of Medicine, New Haven, Conn (M.A.D.); Duke University School of Medicine, Durham, NC (K.M.); North York General Hospital, Toronto, Ontario, Canada (E.S.); and Department of Medical Imaging, University of Toronto, Toronto, Ontario, Canada (E.C.).

^# Contributed equally.

PMID: 39503604
DOI: 10.1148/ryai.240334

Abstract

"Just Accepted" papers have undergone full peer review and have been accepted for publication in Radiology: Artificial Intelligence. This article will undergo copyediting, layout, and proof review before it is published in its final version. Please note that during production of the final copyedited article, errors may be discovered which could affect the content. Purpose To evaluate the performance of the winning machine learning (ML) models from the 2023 RSNA Abdominal Trauma Detection Artificial Intelligence Challenge. Materials and Methods The competition was hosted on Kaggle and took place between July 26, 2023, to October 15, 2023. The multicenter competition dataset consisted of 4,274 abdominal trauma CT scans in which solid organs (liver, spleen and kidneys) were annotated as healthy, low-grade or high-grade injury. Studies were labeled as positive or negative for the presence of bowel/mesenteric injury and active extravasation. In this study, performances of the 8 award-winning models were retrospectively assessed and compared using various metrics, including the area under the receiver operating characteristic curve (AUC), for each injury category. The reported mean values of these metrics were calculated by averaging the performance across all models for each specified injury type. Results The models exhibited strong performance in detecting solid organ injuries, particularly high-grade injuries. For binary detection of injuries, the models demonstrated mean AUC values of 0.92 (range:0.91-0.94) for liver, 0.91 (range:0.87-0.93) for splenic, and 0.94 (range:0.93-0.95) for kidney injuries. The models achieved mean AUC values of 0.98 (range:0.96-0.98) for high-grade liver, 0.98 (range:0.97-0.99) for high-grade splenic, and 0.98 (range:0.97-0.98) for high-grade kidney injuries. For the detection of bowel/mesenteric injuries and active extravasation, the models demonstrated mean AUC values of 0.85 (range:0.74-0.73) and 0.85 (range:0.79-0.89) respectively. Conclusion The award-winning models from the AI challenge demonstrated strong performance in the detection of traumatic abdominal injuries on CT scans, particularly high-grade injuries. These models may serve as a performance baseline for future investigations and algorithms. ©RSNA, 2024.