Development and validation of a federated learning framework for detection of subphenotypes of multisystem inflammatory syndrome in children

Naimin Jing; Xiaokang Liu; Qiong Wu; Suchitra Rao; Asuncion Mejias; Mitchell Maltenfort; Julia Schuchard; Vitaly Lorman; Hanieh Razzaghi; Ryan Webb; Chuan Zhou; Ravi Jhaveri; Grace M Lee; Nathan M Pajor; Deepika Thacker; L Charles Bailey; Christopher B Forrest; Yong Chen

doi:10.1101/2024.01.26.24301827

Development and validation of a federated learning framework for detection of subphenotypes of multisystem inflammatory syndrome in children

medRxiv [Preprint]. 2024 Jan 27:2024.01.26.24301827. doi: 10.1101/2024.01.26.24301827.

Authors

Naimin Jing^{1

2}, Xiaokang Liu¹, Qiong Wu¹, Suchitra Rao³, Asuncion Mejias⁴, Mitchell Maltenfort⁵, Julia Schuchard⁵, Vitaly Lorman⁵, Hanieh Razzaghi⁵, Ryan Webb⁵, Chuan Zhou⁶, Ravi Jhaveri⁷, Grace M Lee⁸, Nathan M Pajor⁹, Deepika Thacker¹⁰, L Charles Bailey^{5

11}, Christopher B Forrest^{5

11}, Yong Chen¹

Affiliations

¹ Perelman School of Medicine, The University of Pennsylvania, Philadelphia, PA.
² Current affiliation: Biostatistics and Research Decision Sciences, Merck & Co., Inc, Kenilworth, NJ.
³ Department of Pediatrics, University of Colorado School of Medicine and Children's Hospital Colorado, Aurora, CO.
⁴ Division of Infectious Diseases, Department of Pediatrics, Nationwide Children's Hospital and The Ohio State University, Columbus, OH.
⁵ Applied Clinical Research Center, Children's Hospital of Philadelphia, Philadelphia, PA.
⁶ Center for Child Health, Behavior and Development, Seattle Children's Hospital, Seattle, WA.
⁷ Division of Infectious Diseases, Ann & Robert H. Lurie Children's Hospital of Chicago, Chicago, IL.
⁸ Department of Pediatrics (Infectious Diseases), Stanford University School of Medicine, Stanford, CA.
⁹ Division of Pulmonary Medicine, Cincinnati Children's Hospital Medical Center and University of Cincinnati College of Medicine, Cincinnati, OH.
¹⁰ Division of Cardiology, Nemours Children's Health, Wilmington, DE.
¹¹ Department of Pediatrics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA.

Abstract

Background: Multisystem inflammatory syndrome in children (MIS-C) is a severe post-acute sequela of SARS-CoV-2 infection. The highly diverse clinical features of MIS-C necessities characterizing its features by subphenotypes for improved recognition and treatment. However, jointly identifying subphenotypes in multi-site settings can be challenging. We propose a distributed multi-site latent class analysis (dMLCA) approach to jointly learn MIS-C subphenotypes using data across multiple institutions.

Methods: We used data from the electronic health records (EHR) systems across nine U.S. children's hospitals. Among the 3,549,894 patients, we extracted 864 patients < 21 years of age who had received a diagnosis of MIS-C during an inpatient stay or up to one day before admission. Using MIS-C conditions, laboratory results, and procedure information as input features for the patients, we applied our dMLCA algorithm and identified three MIS-C subphenotypes. As validation, we characterized and compared more granular features across subphenotypes. To evaluate the specificity of the identified subphenotypes, we further compared them with the general subphenotypes identified in the COVID-19 infected patients.

Findings: Subphenotype 1 (46.1%) represents patients with a mild manifestation of MIS-C not requiring intensive care, with minimal cardiac involvement. Subphenotype 2 (25.3%) is associated with a high risk of shock, cardiac and renal involvement, and an intermediate risk of respiratory symptoms. Subphenotype 3 (28.6%) represents patients requiring intensive care, with a high risk of shock and cardiac involvement, accompanied by a high risk of >4 organ system being impacted. Importantly, for hospital-specific clinical decision-making, our algorithm also revealed a substantial heterogeneity in relative proportions of these three subtypes across hospitals. Properly accounting for such heterogeneity can lead to accurate characterization of the subphenotypes at the patient-level.

Interpretation: Our identified three MIS-C subphenotypes have profound implications for personalized treatment strategies, potentially influencing clinical outcomes. Further, the proposed algorithm facilitates federated subphenotyping while accounting for the heterogeneity across hospitals.

Publication types

Preprint

Grants and funding

OT2 HL161847/HL/NHLBI NIH HHS/United States