Artificial intelligence (AI) has the potential to bring transformative improvements to the field of radiology; yet, there are barriers to widespread clinical adoption. One of the most important barriers has been access to large, well-annotated, widely representative medical image datasets, which can be used to accurately train AI programs. Creating such datasets requires time and expertise and runs into constraints around data security and interoperability, patient privacy, and appropriate data use. Recognizing these challenges, several institutions have started curating and providing publicly available, high-quality datasets that can be accessed by researchers to advance AI models. The purpose of this work was to review the publicly available MRI datasets that can be used for AI research in radiology. Despite being an emerging field, a simple internet search for open MRI datasets presents an overwhelming number of results. Therefore, we decided to create a survey of the major publicly accessible MRI datasets in different subfields of radiology (brain, body, and musculoskeletal), and list the most important features of value to the AI researcher. To complete this review, we searched for publicly available MRI datasets and assessed them based on several parameters (number of subjects, demographics, area of interest, technical features, and annotations). We reviewed 110 datasets across sub-fields with 1,686,245 subjects in 12 different areas of interest ranging from spine to cardiac. This review is meant to serve as a reference for researchers to help spur advancements in the field of AI for radiology. LEVEL OF EVIDENCE: Level 4 TECHNICAL EFFICACY: Stage 6.
Keywords: artificial intelligence; datasets; open-access; public; radiology.
© 2023 International Society for Magnetic Resonance in Medicine.