DocumentCode :
2850522
Title :
Stress Detection Using Speech Spectrograms and Sigma-pi Neuron Units
Author :
He, Ling ; Lech, Margaret ; Maddage, Namunu C. ; Allen, Nicholas
Author_Institution :
Sch. of Electr. & Comput. Eng., RMIT Univ., Melbourne, VIC, Australia
Volume :
2
fYear :
2009
fDate :
14-16 Aug. 2009
Firstpage :
260
Lastpage :
264
Abstract :
This paper presents a new system for automatic stress detection in speech. In the process of feature extraction speech spectrograms were used as the primary features. The sigma-pi neuron cells were then employed to derive the secondary features. The analysis was performed at three alternative sets of analytical frequency bands: critical bands, Bark scale bands and equivalent rectangular bandwidth (ERB) scale bands. The presented algorithm was tested using actual stressful speech utterances from SUSAS (Speech Under Simulated and Actual Stress) database on the vowel-based level. The automatic stress-level classification was implemented using Gaussian mixture model (GMM) and k-nearest neighbor (KNN) classifiers. The strongest effect on the classification results was observed when selecting the type of frequency bands. The ERB scale provided the highest classification results ranging from 67.84% to 73.76%. The classification results did not differ between data sets containing specific types of vowels and data sets containing mixtures of vowels. This indicates that the proposed method can be applied to voiced speech in speech independent conditions.
Keywords :
Gaussian processes; feature extraction; pattern classification; speech processing; Bark scale bands; Gaussian mixture model; SUSAS database; Speech Under Simulated and Actual Stress database; analytical frequency bands; automatic stress detection; critical bands; equivalent rectangular bandwidth scale bands; feature extraction; k-nearest neighbor classifiers; sigma-pi neuron cells; speech spectrograms; Bandwidth; Feature extraction; Frequency; Neurons; Performance analysis; Spatial databases; Spectrogram; Speech processing; Stress; Testing; sigma-pi neural units; spectrogram analysis; speech classification; stress recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Natural Computation, 2009. ICNC '09. Fifth International Conference on
Conference_Location :
Tianjin
Print_ISBN :
978-0-7695-3736-8
Type :
conf
DOI :
10.1109/ICNC.2009.59
Filename :
5365356
Link To Document :
بازگشت