DocumentCode :
2789767
Title :
Influence of acoustic low-level descriptors in the detection of clinical depression in adolescents
Author :
Low, Lu-Shih Alex ; Maddage, Namunu C. ; Lech, Margaret ; Sheeber, Lisa ; Allen, Nicholas
Author_Institution :
Sch. of Electr. & Comput. Eng., RMIT Univ., Melbourne, VIC, Australia
fYear :
2010
fDate :
14-19 March 2010
Firstpage :
5154
Lastpage :
5157
Abstract :
In this paper, we report the influence that classification accuracies have in speech analysis from a clinical dataset by adding acoustic low-level descriptors (LLD) belonging to prosodic (i.e. pitch, formants, energy, jitter, shimmer) and spectral features (i.e. spectral flux, centroid, entropy and roll-off) along with their delta (Δ) and delta-delta (Δ-Δ) coefficients to two baseline features of Mel frequency cepstral coefficients and Teager energy critical-band based autocorrelation envelope. Extracted acoustic low-level descriptors (LLD) that display an increase in accuracy after being added to these baseline features were finally modeled together using Gaussian mixture models and tested. A clinical data set of speech from 139 adolescents, including 68 (49 girls and 19 boys) diagnosed as clinically depressed, was used in the classification experiments. For male subjects, the combination of (TEO-CB-Auto-Env + Δ + Δ-Δ) + F0 + (LogE + Δ + Δ-Δ) + (Shimmer + Δ) + Spectral Flux + Spectral Roll-off gave the highest classification rate of 77.82% while for the female subjects, using TEO-CB-Auto-Env gave an accuracy of 74.74%.
Keywords :
Gaussian distribution; acoustic signal processing; cepstral analysis; diseases; medical signal processing; paediatrics; psychology; signal classification; speech processing; Gaussian mixture models; Mel frequency cepstral coefficients; Teager energy critical-band based autocorrelation envelope; acoustic low-level descriptors; adolescents; clinical depression; delta coefficients; delta-delta coefficients; energy; formants; jitter; pitch; prosodic features; shimmer; spectral centroid; spectral entropy; spectral features; spectral flux; spectral roll-off; speech analysis; Acoustic signal detection; Autocorrelation; Cepstral analysis; Image analysis; Jitter; Mel frequency cepstral coefficient; Pattern analysis; Psychology; Speech analysis; Speech processing; Clinical depression; Gaussian Mixture Model; acoustic features; prosodic feature; spectral feature;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
ISSN :
1520-6149
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2010.5495018
Filename :
5495018
Link To Document :
بازگشت