Using speech recognition technology to investigate the association between timing-related speech features and depression severity

PLoS One. 2020 Sep 11;15(9):e0238726. doi: 10.1371/journal.pone.0238726. eCollection 2020.

Abstract

Background: There are no reliable and validated objective biomarkers for the assessment of depression severity. We aimed to investigate the association between depression severity and timing-related speech features using speech recognition technology.

Method: Patients with major depressive disorder (MDD), those with bipolar disorder (BP), and healthy controls (HC) were asked to engage in a non-structured interview with research psychologists. Using automated speech recognition technology, we measured three timing-related speech features: speech rate, pause time, and response time. The severity of depression was assessed using the Hamilton Depression Rating Scale 17-item version (HAMD-17). We conducted the current study to answer the following questions: 1) Are there differences in speech features among MDD, BP, and HC? 2) Do speech features correlate with depression severity? 3) Do changes in speech features correlate with within-subject changes in depression severity?

Results: We collected 1058 data sets from 241 individuals for the study (97 MDD, 68 BP, and 76 HC). There were significant differences in speech features among groups; depressed patients showed slower speech rate, longer pause time, and longer response time than HC. All timing-related speech features showed significant associations with HAMD-17 total scores. Longitudinal changes in speech rate correlated with changes in HAMD-17 total scores.

Conclusions: Depressed individuals showed longer response time, longer pause time, and slower speech rate than healthy individuals, all of which were suggestive of psychomotor retardation. Our study suggests that speech features could be used as objective biomarkers for the assessment of depression severity.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Artificial Intelligence
  • Bipolar Disorder / physiopathology*
  • Case-Control Studies
  • Depressive Disorder, Major / physiopathology*
  • Female
  • Humans
  • Male
  • Middle Aged
  • Speech*
  • Time Factors

Grants and funding

This research was supported by the Japan Agency for Medical Research and Development (AMED) under Grant Number JP19he11020004. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.