DocumentCode :
417140
Title :
Higher order cepstral moment normalization (HOCMN) for robust speech recognition
Author :
Hsu, Chang-Wen ; Lee, Lin-shan
Author_Institution :
Graduate Inst. of Commun. Eng., Nat. Taiwan Univ., Taiwan
Volume :
1
fYear :
2004
fDate :
17-21 May 2004
Abstract :
Cepstral mean subtraction (CMS) and cepstral normalization (CN) have been popularly used to normalize the first and the second moments of cepstral coefficients, and proved to be very helpful for robust speech recognition (Furui, S. 1981; Viikki, O. and Laurila, K., 1998). A unified formulation for higher order cepstral moment normalization (HOCMN) is developed by extending the concept of CMS and CN to orders much higher than three. A whole family of normalization techniques for different orders is thus proposed. Preliminary experimental results based on Aurora 2.0 showed that the recognition accuracy can be significantly improved with this approach under all noisy conditions. For example, HOCMN(1,5,100) (normalization of the first, fifth and 100th order cepstral moments) is shown to offer an error rate reduction of 32.83% as compared to the conventional CN with a full-utterance processing interval, or an error rate reduction of 20.78% as compared to CN with a segmental processing interval.
Keywords :
acoustic noise; cepstral analysis; error statistics; random noise; speech recognition; cepstral mean subtraction; cepstral normalization; error rate reduction; higher order cepstral moment normalization; robust speech recognition; Cepstral analysis; Collision mitigation; Error analysis; Low-frequency noise; Mel frequency cepstral coefficient; Neural networks; Noise reduction; Probability density function; Robustness; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-8484-9
Type :
conf
DOI :
10.1109/ICASSP.2004.1325956
Filename :
1325956
Link To Document :
بازگشت