مرکز منطقه ای اطلاع رساني علوم و فناوري - Cross-corpus depression prediction from speech

DocumentCode :

730748

Title :

Cross-corpus depression prediction from speech

Author :

Mitra, Vikramjit ; Shriberg, Elizabeth ; Vergyri, Dimitra ; Knoth, Bruce ; Salomon, Ronald M.

Author_Institution :

SRI Int., Menlo Park, CA, USA

fYear :

2015

fDate :

19-24 April 2015

Firstpage :

4769

Lastpage :

4773

Abstract :

Research on detecting depression from speech has advanced in recent years, but most work has focused on the analysis of one corpus at a time. Given that clinical corpora are typically small, it is important to explore approaches that generalize across corpora and that could ultimately be adapted to new data. We study a new corpus of patient-clinician interactions recorded when patients are admitted to a hospital for suicide risk and again when they are released. To train prediction models, we use the 2014 AVEC challenge German speech dataset, which differs from our data in many factors (including language, context, speakers, and recording conditions). Results reveal that some of the AVEC-trained models predict scores for the clinical data that correlate with both HAM-D depression scores and with the pre-/post-admission ordering. A KL-divergence analysis within the clinical data confirms that the same feature set captures changes correlated with the HAM-D scores. Finally, read versus spontaneous speech samples in both corpora behave differently with respect to the best features and modeling approaches. Implications for the cross-corpus prediction of depression are discussed.

Keywords :

audio-visual systems; emotion recognition; medical disorders; patient care; patient treatment; speech processing; text analysis; AVEC-trained model; Audio-Visual Emotion Recognition Challenge; HAM-D depression score; Hamilton depression rating scale; KL-divergence analysis; clinical corpora; cross-corpus depression prediction; feature set; patient-clinician interaction; read speech sample; spontaneous speech sample; train prediction model; Cepstral analysis; Correlation; Feature extraction; Interviews; Robustness; Speech; AVEC Challenge; acoustic features; articulatory features; cross-corpus modeling; depression detection; mental health; phonetic features; prosodic features;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on

Conference_Location :

South Brisbane, QLD

Type :

conf

DOI :

10.1109/ICASSP.2015.7178876

Filename :

7178876

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=730748