Cross-dataset Evaluation of Dementia Longitudinal Progression Prediction Models

Chen Zhang; Lijun An; Naren Wulan; Kim-Ngan Nguyen; Csaba Orban; Pansheng Chen; Christopher Chen; Juan Helen Zhou; Keli Liu; B T Thomas Yeo; Alzheimer’s Disease Neuroimaging Initiative; Australian Imaging Biomarkers and Lifestyle Study of Aging

doi:10.1101/2024.11.18.24317513

Cross-dataset Evaluation of Dementia Longitudinal Progression Prediction Models

medRxiv [Preprint]. 2024 Nov 19:2024.11.18.24317513. doi: 10.1101/2024.11.18.24317513.

Authors

Chen Zhang^{1

2

3}, Lijun An^{1

2

3}, Naren Wulan^{1

2

3}, Kim-Ngan Nguyen¹, Csaba Orban^{1

2

3}, Pansheng Chen^{1

2

3}, Christopher Chen⁴, Juan Helen Zhou^{1

2

5

6}, Keli Liu⁷, B T Thomas Yeo^{1

2

5

3

6

8}; Alzheimer’s Disease Neuroimaging Initiative; Australian Imaging Biomarkers and Lifestyle Study of Aging

Affiliations

¹ Centre for Sleep and Cognition (CSC) & Centre for Translational Magnetic Resonance Research (TMR), Yong Loo Lin School of Medicine, National University of Singapore, Singapore.
² Department of Electrical and Computer Engineering, National University of Singapore, Singapore.
³ N.1 Institute for Health, National University of Singapore, Singapore.
⁴ Memory Aging and Cognition Centre, Department of Pharmacology, Yong Loo Lin School of Medicine, National University of Singapore, Singapore.
⁵ Department of Medicine, Healthy Longevity Translational Research Programme, Human Potential Translational Research Programme & Institute for Digital Medicine (WisDM), Yong Loo Lin School of Medicine, National University of Singapore, Singapore.
⁶ Integrative Sciences and Engineering Programme (ISEP), National University of Singapore.
⁷ Company A, Berkeley, CA, USA.
⁸ Martinos Center for Biomedical Imaging, Massachusetts General Hospital, Charlestown, MA, USA.

Abstract

Accurate Alzheimer's Disease (AD) progression prediction is essential for early intervention. The TADPOLE challenge, involving 92 algorithms, used multimodal biomarkers to predict future clinical diagnosis, cognition, and ventricular volume. The winning algorithm, FROG, utilized a Longitudinal-to-Cross-sectional (L2C) transformation to convert variable longitudinal histories into fixed-length feature vectors, which contrasted with most existing approaches that fitted models to entire longitudinal histories, e.g., AD Course Map (AD-Map) and minimal recurrent neural networks (MinimalRNN). The TADPOLE challenge only utilized the Alzheimer's Disease Neuroimaging Initiative (ADNI) dataset. To evaluate FROG's generalizability, we trained it on the ADNI dataset and tested it on three external datasets covering 2,312 participants and 13,200 timepoints. We also introduced two FROG variants. One variant, L2C feedforward neural network (L2C-FNN), unified all XGBoost models used by the original FROG with an FNN. Across external datasets, L2C-FNN and AD-Map were the best for predicting cognition and ventricular volume. For clinical diagnosis prediction, L2C-FNN was the best, while AD-Map was the worst. L2C-FNN compared favorably with other approaches regardless of the number of observed timepoints, and when predicting from 0 to 6 years into the future, underscoring its potential for long-term dementia progression prediction. Pretrained ADNI models are publicly available: GITHUB_LINK.

Keywords: Alzheimer’s disease; XGBoost; domain generalization; feature engineering; longitudinal progression modelling; recurrent neural networks.

Publication types

Preprint

Abstract

Publication types

Grants and funding