DocumentCode :
3488138
Title :
Classification of stress in speech using linear and nonlinear features
Author :
Nwe, Tin Lay ; Foo, Say Wei ; De Silva, Liyanage C.
Volume :
2
fYear :
2003
fDate :
6-10 April 2003
Abstract :
Three systems for the classification of stress in speech are proposed. The first system makes use of linear short time log frequency power coefficients (LFPC), the second employs a Teager energy operator (TEO) based nonlinear frequency domain LFPC features (NFD-LFPC) and the third uses TEO based nonlinear time domain LFPC features (NTD-LFPC). The systems were tested using the SUSAS (speech under simulated and actual stress) database to categorize five stress conditions individually. Results show that the system using LFPC gives the highest accuracy, followed by the system using NFD-LFPC features, while the system using NTD-LFPC features gives the worst performance. For the system using linear LFPC features, average accuracy of 84% and best accuracy of 95% were obtained in classifying five stress categories.
Keywords :
emotion recognition; feature extraction; frequency-domain analysis; hidden Markov models; speech recognition; time-domain analysis; HMM; Teager energy operator; emotional states; feature extraction; hidden Markov model; linear short time log frequency power coefficients; nonlinear frequency domain features; nonlinear time domain features; speech stress classification; Drives; Frequency; Hidden Markov models; Power engineering and energy; Power engineering computing; Speech analysis; Speech enhancement; Speech recognition; Stress; Tin;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-7663-3
Type :
conf
DOI :
10.1109/ICASSP.2003.1202281
Filename :
1202281
Link To Document :
بازگشت