Development of an individualized dementia risk prediction model using deep learning survival analysis incorporating genetic and environmental factors

Shiqi Yuan; Qing Liu; Xiaxuan Huang; Shanyuan Tan; Zihong Bai; Juan Yu; Fazhen Lei; Huan Le; Qingqing Ye; Xiaoxue Peng; Juying Yang; Yitong Ling; Jun Lyu

doi:10.1186/s13195-024-01663-w

Development of an individualized dementia risk prediction model using deep learning survival analysis incorporating genetic and environmental factors

Alzheimers Res Ther. 2024 Dec 30;16(1):278. doi: 10.1186/s13195-024-01663-w.

Authors

Shiqi Yuan^#^{1

2}, Qing Liu^#², Xiaxuan Huang¹, Shanyuan Tan¹, Zihong Bai¹, Juan Yu², Fazhen Lei², Huan Le², Qingqing Ye², Xiaoxue Peng², Juying Yang², Yitong Ling³, Jun Lyu^{4

5}

Affiliations

¹ Department of Neurology, The First Affiliated Hospital of Jinan University, No.613, Huangpu Road West, Guangzhou, Guangdong Province, 510630, China.
² Department of Neurology, The Second People's Hospital of Guiyang (The Affiliated Jinyang Hospital of Guizhou Medical University), Guiyang, Guizhou Province, 550000, China.
³ Department of Neurology, The First Affiliated Hospital of Jinan University, No.613, Huangpu Road West, Guangzhou, Guangdong Province, 510630, China. [email protected].
⁴ Department of Clinical Research, The First Affiliated Hospital of Jinan University, No.613, Huangpu Road West, Guangzhou, Guangdong Province, 510630, China. [email protected].
⁵ Guangdong Provincial Key Laboratory of Traditional Chinese Medicine Informatization, Guangzhou, Guangdong, 510630, China. [email protected].

^# Contributed equally.

Abstract

Background: Dementia is a major public health challenge in modern society. Early detection of high-risk dementia patients and timely intervention or treatment are of significant clinical importance. Neural network survival analysis represents the most advanced technology for survival analysis to date. However, there is a lack of deep learning-based survival analysis models that integrate both genetic and clinical factors to develop and validate individualized dynamic dementia risk prediction models.

Methods and results: This study is based on a large prospective cohort from the UK Biobank, which includes a total of 41,484 participants with an average follow-up period of 12.6 years. Initially, 364 candidate features (predictor variables) were screened. The top 30 key features were then identified by ranking the importance of each predictor variable using the Gradient Boosting Machine (GBM) model. A multi-model comparison strategy was employed to evaluate the predictive performance of four survival analysis models: DeepSurv, DeepHit, Kaplan-Meier estimation, and the Cox proportional hazards model (CoxPH). The results showed that the average Harrell's C-index for the DeepSurv model was 0.743, for the DeepHit model it was 0.633, for the CoxPH model it was 0.749, and for the Kaplan-Meier estimator model it was 0.500. In addition, the average D-Calibration Survival Measure was 6.014, 4408.086, 32274.743, and 1.508, respectively. The Brier score (BS) was used to assess the importance of features for the DeepSurv dementia prediction model, and the relationship between features and dementia was visualized using a partial dependence plot (PDP). To facilitate further research, the team deployed the DeepSurv dementia prediction model on AliCloud servers and designated it as the UKB-DementiaPre Tool.

Conclusion: This study successfully developed and validated the DeepSurv dementia prediction model for individuals aged 60 years and above, integrating both genetic and clinical data. The model was then deployed on AliCloud servers to promote its clinical translation. It is anticipated that this prediction model will provide more accurate decision support for clinical treatment and will serve as a valuable tool for the primary prevention of dementia.

Keywords: DeepSurv; Dementia; Risk prediction model; Survival analysis.

MeSH terms

Aged
Deep Learning*
Dementia* / epidemiology
Dementia* / genetics
Female
Humans
Male
Middle Aged
Proportional Hazards Models
Prospective Studies
Risk Assessment / methods
Risk Factors
Survival Analysis
United Kingdom / epidemiology

Abstract

MeSH terms

Grants and funding