Temporal multi-step predictive modeling of remission in major depressive disorder using early stage treatment data; STAR*D based machine learning approach

Haitham Salem; Tung Huynh; Natasha Topolski; Benson Mwangi; Madhukar H Trivedi; Jair C Soares; A John Rush; Sudhakar Selvaraj

doi:10.1016/j.jad.2022.12.076

Temporal multi-step predictive modeling of remission in major depressive disorder using early stage treatment data; STAR*D based machine learning approach

J Affect Disord. 2023 Mar 1:324:286-293. doi: 10.1016/j.jad.2022.12.076. Epub 2022 Dec 28.

Authors

Haitham Salem¹, Tung Huynh², Natasha Topolski², Benson Mwangi², Madhukar H Trivedi³, Jair C Soares², A John Rush⁴, Sudhakar Selvaraj⁵

Affiliations

¹ Department of Psychiatry and Human Behavior (DPHB), Warren Alpert School of Medicine, Brown University, Providence, RI, USA.
² Louis Faillace Department of Psychiatry and Behavioral Science, McGovern Medical School, University of Texas Health Science Center, Houston, TX, USA.
³ Department of Psychiatry, University of Texas Southwestern Medical Center, Dallas, TX, USA.
⁴ Department of Psychiatry and Behavioral Sciences, Duke University School of Medicine, Durham, NC, USA; Professor Emeritus, Duke-National University of Singapore, Singapore, Singapore.
⁵ Louis Faillace Department of Psychiatry and Behavioral Science, McGovern Medical School, University of Texas Health Science Center, Houston, TX, USA. Electronic address: [email protected].

Abstract

Background: Artificial intelligence is currently being used to facilitate early disease detection, better understand disease progression, optimize medication/treatment dosages, and uncover promising novel treatments and potential outcomes.

Methods: Utilizing the Sequenced Treatment Alternatives to Relieve Depression (STAR*D) dataset, we built a machine learning model to predict depression remission rates using same clinical data as features for each of the first three antidepressant treatment steps in STAR*D. We only used early treatment data (baseline and first follow up) in each STAR*D step to temporally analyze predictive features of remission at the end of the step.

Results: Our model showed significant prediction performance across the three treatment steps, At step 1, Model accuracy was 66 %; sensitivity-65 %, specificity-67 %, positive predictive value (PPV)-65.5 %, and negative predictive value (NPV)-66.6 %. At step 2, model accuracy was 71.3 %, sensitivity-74.3 %, specificity-69 %, PPV-64.5 %, and NPV-77.9 %. At step 3, accuracy reached 84.6 %; sensitivity-69 %, specificity-88.8 %, PPV-67 %, and NPV-91.1 %. Across all three steps, the early Quick Inventory of Depressive Symptomatology-Self-Report (QIDS-SR) scores were key elements in predicting the final treatment outcome. The model also identified key sociodemographic factors that predicted treatment remission at different steps.

Limitations: The retrospective design, lack of replication in an independent dataset, and the use of "a complete case analysis" model in our analysis.

Conclusions: This proof-of-concept study showed that using early treatment data, multi-step temporal prediction of depressive symptom remission results in clinically useful accuracy rates. Whether these predictive models are generalizable deserves further study.

Trial registration: ClinicalTrials.gov NCT00021528.

Keywords: Decision trees; Depression; Machine learning; Predictive models; Remission.

Publication types

Research Support, U.S. Gov't, P.H.S.
Research Support, N.I.H., Extramural

MeSH terms

Antidepressive Agents / therapeutic use
Artificial Intelligence
Citalopram / therapeutic use
Depressive Disorder, Major* / diagnosis
Depressive Disorder, Major* / drug therapy
Humans
Machine Learning
Retrospective Studies
Treatment Outcome

Temporal multi-step predictive modeling of remission in major depressive disorder using early stage treatment data; STAR*D based machine learning approach

Authors

Affiliations

Abstract

Publication types

MeSH terms

Substances

Associated data

Grants and funding