Healthy core: Harmonizing brain MRI for supporting multicenter migraine classification studies

Hyunsoo Yoon; Todd J Schwedt; Catherine D Chong; Oyekanmi Olatunde; Teresa Wu

doi:10.1371/journal.pone.0288300

Healthy core: Harmonizing brain MRI for supporting multicenter migraine classification studies

PLoS One. 2024 Dec 31;19(12):e0288300. doi: 10.1371/journal.pone.0288300. eCollection 2024.

Authors

Hyunsoo Yoon¹, Todd J Schwedt^{2

3}, Catherine D Chong^{2

3}, Oyekanmi Olatunde⁴, Teresa Wu^{3

5}

Affiliations

¹ Department of Industrial Engineering, Yonsei University, Seoul, Republic of Korea.
² Department of Neurology, Mayo Clinic, Scottsdale, Arizona, United States of America.
³ ASU-Mayo Center for Innovative Imaging, Tempe, Arizona, United States of America.
⁴ Department of Systems Science and Industrial Engineering, Binghamton University, Binghamton, New York, United States of America.
⁵ School of Computing and Augmented Intelligence, Arizona State University, Tempe, Arizona, United States of America.

Abstract

Multicenter and multi-scanner imaging studies may be necessary to ensure sufficiently large sample sizes for developing accurate predictive models. However, multicenter studies, incorporating varying research participant characteristics, MRI scanners, and imaging acquisition protocols, may introduce confounding factors, potentially hindering the creation of generalizable machine learning models. Models developed using one dataset may not readily apply to another, emphasizing the importance of classification model generalizability in multi-scanner and multicenter studies for producing reproducible results. This study focuses on enhancing generalizability in classifying individual migraine patients and healthy controls using brain MRI data through a data harmonization strategy. We propose identifying a 'healthy core'-a group of homogeneous healthy controls with similar characteristics-from multicenter studies. The Maximum Mean Discrepancy (MMD) in Geodesic Flow Kernel (GFK) space is employed to compare two datasets, capturing data variabilities and facilitating the identification of this 'healthy core'. Homogeneous healthy controls play a vital role in mitigating unwanted heterogeneity, enabling the development of highly accurate classification models with improved performance on new datasets. Extensive experimental results underscore the benefits of leveraging a 'healthy core'. We utilized two datasets: one comprising 120 individuals (66 with migraine and 54 healthy controls), and another comprising 76 individuals (34 with migraine and 42 healthy controls). Notably, a homogeneous dataset derived from a cohort of healthy controls yielded a significant 25% accuracy improvement for both episodic and chronic migraineurs.

Copyright: © 2024 Yoon et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Publication types

Multicenter Study

MeSH terms

Adult
Brain* / diagnostic imaging
Brain* / pathology
Case-Control Studies
Female
Humans
Machine Learning
Magnetic Resonance Imaging* / methods
Male
Middle Aged
Migraine Disorders* / classification
Migraine Disorders* / diagnostic imaging
Multicenter Studies as Topic

Grants and funding

YES, This work was supported by the United States Department of Defense W81XWH-15-1-0286, Schwedt, Todd, (https://cdmrp.health.mil/search.aspx?LOG_NO=PR140037) National Institutes of Health K23NS070891, Schwedt, Todd, (https://report.nih.gov/reportweb/web/displayreport?rid=568&ver=5) and internal funds from the Mayo Clinic (Schwedt, Todd). There was no additional external funding received for this study.