DocumentCode
417143
Title
Cepstral gain normalization for noise robust speech recognition
Author
Yoshizawa, Shingo ; Hayasaka, Noboru ; Wada, Naoya ; Miyanaga, Yoshikazu
Author_Institution
Graduate Sch. of Eng., Hokkaido Univ., Sapporo, Japan
Volume
1
fYear
2004
fDate
17-21 May 2004
Abstract
The paper describes a robust speech recognition technique which normalizes cepstral gains in order to remove effects of additive noise. We assume that the effects can be expressed by an approximate model which consists of gain and DC components in log-spectrum. Accordingly, we propose cepstral gain normalization (CGN) which normalizes the gains by means of calculating maximum and minimum values of cepstral coefficients in speech frames. The proposed method can extract noise robust features without a priori knowledge and environmental adaptation because it is applied to both training and testing data. We have evaluated recognition performance under noisy environments using the Noisex-92 database and a 100 Japanese city names task. The CGN provides improvements of recognition accuracy at various SNRs compared with combinations of conventional methods.
Keywords
acoustic noise; cepstral analysis; random noise; speech recognition; additive noise; cepstral coefficients; cepstral gain normalization; recognition accuracy; robust speech recognition; speech frames; Additive noise; Cepstral analysis; Cities and towns; Data mining; Feature extraction; Noise robustness; Spatial databases; Speech recognition; Testing; Working environment noise;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-8484-9
Type
conf
DOI
10.1109/ICASSP.2004.1325959
Filename
1325959
Link To Document