Title :
Classification of stress in speech using linear and nonlinear features
Author :
Nwe, Tin Lay ; Foo, Say Wei ; De Silva, Liyanage C.
Abstract :
Three systems for the classification of stress in speech are proposed. The first system makes use of linear short time log frequency power coefficients (LFPC), the second employs a Teager energy operator (TEO) based nonlinear frequency domain LFPC features (NFD-LFPC) and the third uses TEO based nonlinear time domain LFPC features (NTD-LFPC). The systems were tested using the SUSAS (speech under simulated and actual stress) database to categorize five stress conditions individually. Results show that the system using LFPC gives the highest accuracy, followed by the system using NFD-LFPC features, while the system using NTD-LFPC features gives the worst performance. For the system using linear LFPC features, average accuracy of 84% and best accuracy of 95% were obtained in classifying five stress categories.
Keywords :
emotion recognition; feature extraction; frequency-domain analysis; hidden Markov models; speech recognition; time-domain analysis; HMM; Teager energy operator; emotional states; feature extraction; hidden Markov model; linear short time log frequency power coefficients; nonlinear frequency domain features; nonlinear time domain features; speech stress classification; Drives; Frequency; Hidden Markov models; Power engineering and energy; Power engineering computing; Speech analysis; Speech enhancement; Speech recognition; Stress; Tin;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
Print_ISBN :
0-7803-7663-3
DOI :
10.1109/ICASSP.2003.1202281