Background: Parkinson's disease (PD), the second most common neurodegenerative disease in the world, is usually not diagnosed until the later stages of the disease, when patients might have already missed the best treatment period. Therefore, more effective prediction methods based on artificial intelligence (AI) are needed to assist physicians in timely diagnosis.
New methods: An explainable deep learning-based early Parkinson's disease diagnostic model, Parkinson's Integrative Diagnostic Gated Network (PIDGN), was designed by fusing Single Nucleotide Polymorphism (SNP) and brain sMRI data. Firstly, unimodal internal information was extracted using EmsembleTree dimensionality reduction method, Transformer encoder and 3D ResNet. Secondly, gated attention fusion technique was utilized to explore the inter-modal interactions. Finally, the classification results were output through the fully connected layer. SHapley additive interpretation (SHAP) values and Gradient-weighted Class Activation Mapping (GradCAM) techniques were used to help explain the importance of SNPs and brain regions for PD.
Results: The results showed that the PIDGN model achieved the best results with the accuracy of 0.858 and AUROC of 0.897. Top 20 SNPs and the brain regions near the midbrain potentially related to PD were identified using two explainable techniques via SHAP values and GradCAM respectively.
Comparison with existing methods and conclusion: The PIDGN model trained by fusing genetic and imaging data outperforms 13 other commonly used unimodal or bimodal models. Explainable PIDGN model helps deepen understanding of several SNPs and sMRI key factors that may affect PD. This study provides a potentially effective solution for automated early diagnosis of PD using AI.
Keywords: Explainable deep learning; Imaging genomics; Multimodal learning; Parkinson's disease; Transformer.
Copyright © 2025 Elsevier B.V. All rights reserved.