Momentary Depression Severity Prediction in Patients With Acute Depression Who Undergo Sleep Deprivation Therapy: Speech-Based Machine Learning Approach

Lisa-Marie Hartnagel; Daniel Emden; Jerome C Foo; Fabian Streit; Stephanie H Witt; Josef Frank; Matthias F Limberger; Sara E Schmitz; Maria Gilles; Marcella Rietschel; Tim Hahn; Ulrich W Ebner-Priemer; Lea Sirignano

doi:10.2196/64578

Momentary Depression Severity Prediction in Patients With Acute Depression Who Undergo Sleep Deprivation Therapy: Speech-Based Machine Learning Approach

JMIR Ment Health. 2024 Dec 23:11:e64578. doi: 10.2196/64578.

Authors

Lisa-Marie Hartnagel¹, Daniel Emden², Jerome C Foo^{3

4

5

6}, Fabian Streit^{3

7

8}, Stephanie H Witt³, Josef Frank³, Matthias F Limberger¹, Sara E Schmitz¹, Maria Gilles⁷, Marcella Rietschel³, Tim Hahn², Ulrich W Ebner-Priemer^{1

7}, Lea Sirignano³

Affiliations

¹ Mental mHealth Lab, Institute of Sports and Sports Science, Karlsruhe Institute of Technology, Hertzstr. 16, Building 06.31, Karlsruhe, 76187, Germany, 49 721 608 47543.
² Medical Machine Learning Lab, Institute for Translational Psychiatry, University of Münster, Münster, Germany.
³ Department of Genetic Epidemiology in Psychiatry, Central Institute of Mental Health, Medical Faculty Mannheim / Heidelberg University, Mannheim, Germany.
⁴ Institute for Psychopharmacology, Central Institute of Mental Health, Medical Faculty Mannheim / Heidelberg University, Mannheim, Germany.
⁵ Neuroscience and Mental Health Institute, University of Alberta, Edmonton, AB, Canada.
⁶ Department of Psychiatry, College of Health Sciences, University of Alberta, Edmonton, AB, Canada.
⁷ Department of Psychiatry and Psychotherapy, Central Institute of Mental Health, Medical Faculty Mannheim / Heidelberg University, Mannheim, Germany.
⁸ Hector Institute for Artificial Intelligence in Psychiatry, Central Institute of Mental Health, Medical Faculty Mannheim / Heidelberg University, Mannheim, Germany.

PMID: 39714272
PMCID: PMC11684135
DOI: 10.2196/64578

Abstract

Background: Mobile devices for remote monitoring are inevitable tools to support treatment and patient care, especially in recurrent diseases such as major depressive disorder. The aim of this study was to learn if machine learning (ML) models based on longitudinal speech data are helpful in predicting momentary depression severity. Data analyses were based on a dataset including 30 inpatients during an acute depressive episode receiving sleep deprivation therapy in stationary care, an intervention inducing a rapid change in depressive symptoms in a relatively short period of time. Using an ambulatory assessment approach, we captured speech samples and assessed concomitant depression severity via self-report questionnaire over the course of 3 weeks (before, during, and after therapy). We extracted 89 speech features from the speech samples using the Extended Geneva Minimalistic Acoustic Parameter Set from the Open-Source Speech and Music Interpretation by Large-Space Extraction (audEERING) toolkit and the additional parameter speech rate.

Objective: We aimed to understand if a multiparameter ML approach would significantly improve the prediction compared to previous statistical analyses, and, in addition, which mechanism for splitting training and test data was most successful, especially focusing on the idea of personalized prediction.

Methods: To do so, we trained and evaluated a set of >500 ML pipelines including random forest, linear regression, support vector regression, and Extreme Gradient Boosting regression models and tested them on 5 different train-test split scenarios: a group 5-fold nested cross-validation at the subject level, a leave-one-subject-out approach, a chronological split, an odd-even split, and a random split.

Results: In the 5-fold cross-validation, the leave-one-subject-out, and the chronological split approaches, none of the models were statistically different from random chance. The other two approaches produced significant results for at least one of the models tested, with similar performance. In total, the superior model was an Extreme Gradient Boosting in the odd-even split approach (R²=0.339, mean absolute error=0.38; both P<.001), indicating that 33.9% of the variance in depression severity could be predicted by the speech features.

Conclusions: Overall, our analyses highlight that ML fails to predict depression scores of unseen patients, but prediction performance increased strongly compared to our previous analyses with multilevel models. We conclude that future personalized ML models might improve prediction performance even more, leading to better patient management and care.

Keywords: ambulatory assessment; depression; depressive disorder; digital health; mHealth; machine learning; mental health; mobile health; mobile phone; openSMILE; remote monitoring; sleep deprivation therapy; speech features.

© Lisa-Marie Hartnagel, Daniel Emden, Jerome C Foo, Fabian Streit, Stephanie H Witt, Josef Frank, Matthias F Limberger, Sara E Schmitz, Maria Gilles, Marcella Rietschel, Tim Hahn, Ulrich W Ebner-Priemer, Lea Sirignano. Originally published in JMIR Mental Health (https://mental.jmir.org).

MeSH terms

Adult
Depression / diagnosis
Depression / psychology
Depression / therapy
Depressive Disorder, Major / diagnosis
Depressive Disorder, Major / psychology
Depressive Disorder, Major / therapy
Female
Humans
Machine Learning*
Male
Middle Aged
Severity of Illness Index
Sleep Deprivation* / psychology
Speech / physiology