DocumentCode
336769
Title
Speaker normalized spectral subband parameters for noise robust speech recognition
Author
Tsuge, Satoru ; Fukuda, Toshio ; Singer, Harald
Author_Institution
ATR Interpreting Telephony Res. Labs., Kyoto, Japan
Volume
1
fYear
1999
fDate
15-19 Mar 1999
Firstpage
285
Abstract
This paper proposes speaker normalized spectral subband centroids (SSCs) as supplementary features in noise environment speech recognition. SSCs are computed as frequency centroids for each subband from the power spectrum of the speech signal. Since the conventional SSCs depend on the formant frequencies of a speaker, we introduce a speaker normalization technique into SSC computation to reduce the speaker variability. Experimental results on spontaneous speech recognition show that the speaker normalized SSCs are more useful as supplementary features for improving the recognition performance than the conventional SSCs
Keywords
feature extraction; noise; parameter estimation; spectral analysis; speech recognition; experimental results; formant frequencies; frequency centroids; noise robust speech recognition; power spectrum; recognition performance; speaker normalized spectral subband centroids; speaker normalized spectral subband parameters; speaker variability reduction; speech signal; spontaneous speech recognition; Databases; Dynamic range; Feature extraction; Frequency conversion; Noise robustness; Noise shaping; Signal to noise ratio; Speech recognition; Testing; Working environment noise;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
Conference_Location
Phoenix, AZ
ISSN
1520-6149
Print_ISBN
0-7803-5041-3
Type
conf
DOI
10.1109/ICASSP.1999.758118
Filename
758118
Link To Document