DocumentCode
3423997
Title
Cepstral shape normalization (CSN) for robust speech recognition
Author
Du, Jun ; Wang, Ren-Hua
Author_Institution
Univ. of Sci. & Technol. of China, Hefei
fYear
2008
fDate
March 31 2008-April 4 2008
Firstpage
4389
Lastpage
4392
Abstract
In this paper, we propose a new feature normalization approach for robust speech recognition. It is found that the shape of speech feature distributions is changed in noisy environments compared with that in the clean condition. So cepstral shape normalization (CSN) which normalizes the shape of feature distributions is performed by exploiting an exponential factor. This method has been proven effective in noisy environments, especially under low SNRs. Experimental results show that the proposed method yields relative word error rate reductions of 38% and 25% on aurora2 and aurora3 databases, respectively, in comparing with those of the conventional mean and variance normalization (MVN). It is also shown CSN consistently outperforms other traditional methods, such as histogram equalization (HEQ) and higher order cepstral moment normalization (HOCMN).
Keywords
speech recognition; statistical analysis; cepstral shape normalization; feature normalization; higher order cepstral moment normalization; histogram equalization; mean normalization; speech feature distributions; speech recognition; variance normalization; Automatic speech recognition; Cepstral analysis; Gaussian noise; Histograms; Noise robustness; Noise shaping; Shape; Speech analysis; Speech recognition; Working environment noise; robust speech recognition; shape normalization;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location
Las Vegas, NV
ISSN
1520-6149
Print_ISBN
978-1-4244-1483-3
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2008.4518628
Filename
4518628
Link To Document