DocumentCode :
3346702
Title :
Improved speech recognition via speaker stress directed classification
Author :
Womak, B.D. ; Hansen, John H L
Author_Institution :
Robust Speech Process. Lab., Duke Univ., Durham, NC, USA
Volume :
1
fYear :
1996
fDate :
7-10 May 1996
Firstpage :
53
Abstract :
Speech production variations due to perceptually induced stress contribute significantly to reduced speech processing performance. This study proposes an algorithm for estimation of the degree of perceptually induced stress. It is suggested that the resulting stress score could be integrated into speech processing algorithms to improve robustness in adverse conditions. First, results from a previous study motivate selection of a targeted set of speech features across phoneme and stress groups to improve stress classification performance. Analysis of articulatory, excitation, and cepstral based features is conducted using a previously established stressed speech database (SUSAS). Targeted feature sets are selected across ten stress conditions (including Apache helicopter, angry, clear, Lombard effect, loud, etc.). Next, an improved targeted feature stress classification system is developed and evaluated achieving rates of 91.01%. Finally, application of stress classification is incorporated into a stress directed speech recognition system. An improvement of +10.14% and +15.43% over conventionally trained neutral and multi-style trained recognizers is demonstrated using the new stress directed recognition system
Keywords :
cepstral analysis; feature extraction; speech processing; speech recognition; Apache helicopter; Lombard effect; angry speech; articulatory features; cepstral based features; clear speech; excitation features; loud speech; multistyle trained recognizers; neutral trained recognizers; perceptually induced stress estimation; phoneme groups; speaker stress directed classification; speech features; speech processing algorithms; speech processing performance; speech production variations; speech recognition; stress classification performance; stress score; stressed speech database; Cepstral analysis; Helicopters; Laboratories; Robustness; Spatial databases; Speech analysis; Speech processing; Speech recognition; Stress; Target recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
Conference_Location :
Atlanta, GA
ISSN :
1520-6149
Print_ISBN :
0-7803-3192-3
Type :
conf
DOI :
10.1109/ICASSP.1996.540288
Filename :
540288
Link To Document :
بازگشت