Validation of depression determinants in caregivers of dementia patients with machine learning algorithms and statistical model

Front Med (Lausanne). 2023 Feb 2:10:1095385. doi: 10.3389/fmed.2023.1095385. eCollection 2023.

Abstract

Introduction: Due to its increasing prevalence, dementia is currently one of the most extensively studied health issues. Although it represents a comparatively less-addressed issue, the caregiving burden for dementia patients is likewise receiving attention.

Methods: To identify determinants of depression in dementia caregivers, using Community Health Survey (CHS) data collected by the Korea Disease Control and Prevention Agency (KDCA). By setting "dementia caregiver's status of residence with patient" as a standard variable, we selected corresponding CHS data from 2011 to 2019. After refining the data, we split dementia caregiver and general population groups among the dataset (n = 15,708; common variables = 34). We then applied three machine learning algorithms: Extreme Gradient Boosting (XGBoost), Logistic Regression (LR), and Support Vector Classifier (SVC). Subsequently, we selected XGBoost, as it exhibited superior performance to the other algorithms. On the feature importance of XGBoost, we performed a multivariate hierarchical regression analysis to validate the depression causes experienced in each group. We validated the results of the statistical model analysis by performing Welch's t-test on the main determinants exhibited within each group.

Results: By verifying the results from machine learning via statistical model analysis, we found "sex" to highly impact depression in dementia caregivers, whereas "status of economic activities" is significantly associated with depression in the general population.

Discussion: The evident difference in causes of depression between the two groups may serve as a basis for policy development to improve the mental health of dementia caregivers.

Keywords: caregiver; community health survey; dementia; depression; machine learning; statistical model.

Grants and funding

This research was supported by the Yonsei Signature Research Cluster Program of 2021 (2021-22-0005). This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT) (No. 2019R1A2C1007399).