DocumentCode :
2853699
Title :
Neural Networks and TEO Features for an Automatic Recognition of Stress in Spontaneous Speech
Author :
He, Ling ; Lech, Margaret ; Maddage, Namunu C. ; Allen, Nicholas
Author_Institution :
Sch. of Electr. & Comput. Eng., RMIT Univ., Melbourne, VIC, Australia
Volume :
2
fYear :
2009
fDate :
14-16 Aug. 2009
Firstpage :
227
Lastpage :
231
Abstract :
This study presents automatic stress recognition methods based on acoustic speech analysis. Novel approaches to feature extraction based on the nonlinear Teager energy operator (TEO) calculated within critical bands, discrete wavelet transform bands, and wavelet packet bands are presented. The classification process was performed using two types of neural networks: the multilayer perceptron neural network (MLPNN) and the probabilistic neural network (PNN). The classification efficiency was tested using the actual stress dataset from the SUSAS database. The speech recordings were made by 15 speakers (8 females and 7 males) reading a list of 35 words under three actual conditions: high stress, low stress, and neutral. The best overall performance was observed for the features extracted using the TEO parameters calculated within perceptual wavelet packet bands(TEO-PWP). Depending on the type of mother wavelet, the correct classification scores for the PWP features ranged from 71.24% to 91.56% (using the MLPNN classifier), and from 86.63% to 93.67% (using the PNN). The PNN classifier outperformed the MLPNN classification method.
Keywords :
database management systems; discrete wavelet transforms; feature extraction; multilayer perceptrons; speech recognition; MLPNN; SUSAS database; TEO features; automatic recognition; automatic stress recognition methods; discrete wavelet transform; feature extraction; multilayer perceptron neural network; neural networks; nonlinear Teager energy operator; perceptual wavelet packet bands; probabilistic neural network; spontaneous speech stress; stress dataset; wavelet packet bands; Automatic speech recognition; Discrete wavelet transforms; Feature extraction; Multi-layer neural network; Multilayer perceptrons; Neural networks; Speech analysis; Stress; Testing; Wavelet packets; MLPNN; PNN; TEO features; speech classification; stress and emotion recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Natural Computation, 2009. ICNC '09. Fifth International Conference on
Conference_Location :
Tianjin
Print_ISBN :
978-0-7695-3736-8
Type :
conf
DOI :
10.1109/ICNC.2009.56
Filename :
5365560
Link To Document :
بازگشت