Title :
Cepstral gain normalization for noise robust speech recognition
Author :
Yoshizawa, Shingo ; Hayasaka, Noboru ; Wada, Naoya ; Miyanaga, Yoshikazu
Author_Institution :
Graduate Sch. of Eng., Hokkaido Univ., Sapporo, Japan
Abstract :
The paper describes a robust speech recognition technique which normalizes cepstral gains in order to remove effects of additive noise. We assume that the effects can be expressed by an approximate model which consists of gain and DC components in log-spectrum. Accordingly, we propose cepstral gain normalization (CGN) which normalizes the gains by means of calculating maximum and minimum values of cepstral coefficients in speech frames. The proposed method can extract noise robust features without a priori knowledge and environmental adaptation because it is applied to both training and testing data. We have evaluated recognition performance under noisy environments using the Noisex-92 database and a 100 Japanese city names task. The CGN provides improvements of recognition accuracy at various SNRs compared with combinations of conventional methods.
Keywords :
acoustic noise; cepstral analysis; random noise; speech recognition; additive noise; cepstral coefficients; cepstral gain normalization; recognition accuracy; robust speech recognition; speech frames; Additive noise; Cepstral analysis; Cities and towns; Data mining; Feature extraction; Noise robustness; Spatial databases; Speech recognition; Testing; Working environment noise;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
Print_ISBN :
0-7803-8484-9
DOI :
10.1109/ICASSP.2004.1325959