DocumentCode :
417143
Title :
Cepstral gain normalization for noise robust speech recognition
Author :
Yoshizawa, Shingo ; Hayasaka, Noboru ; Wada, Naoya ; Miyanaga, Yoshikazu
Author_Institution :
Graduate Sch. of Eng., Hokkaido Univ., Sapporo, Japan
Volume :
1
fYear :
2004
fDate :
17-21 May 2004
Abstract :
The paper describes a robust speech recognition technique which normalizes cepstral gains in order to remove effects of additive noise. We assume that the effects can be expressed by an approximate model which consists of gain and DC components in log-spectrum. Accordingly, we propose cepstral gain normalization (CGN) which normalizes the gains by means of calculating maximum and minimum values of cepstral coefficients in speech frames. The proposed method can extract noise robust features without a priori knowledge and environmental adaptation because it is applied to both training and testing data. We have evaluated recognition performance under noisy environments using the Noisex-92 database and a 100 Japanese city names task. The CGN provides improvements of recognition accuracy at various SNRs compared with combinations of conventional methods.
Keywords :
acoustic noise; cepstral analysis; random noise; speech recognition; additive noise; cepstral coefficients; cepstral gain normalization; recognition accuracy; robust speech recognition; speech frames; Additive noise; Cepstral analysis; Cities and towns; Data mining; Feature extraction; Noise robustness; Spatial databases; Speech recognition; Testing; Working environment noise;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-8484-9
Type :
conf
DOI :
10.1109/ICASSP.2004.1325959
Filename :
1325959
Link To Document :
بازگشت