DocumentCode :
2839128
Title :
Double Gaussian based feature normalization for robust speech recognition
Author :
Liu, Bo ; Li-Rong Dai ; Li, Jin-Yu ; Wang, Ren-Hua
Author_Institution :
Univ. of Sci. & Technol. of China, Anhui, China
fYear :
2004
fDate :
15-18 Dec. 2004
Firstpage :
253
Lastpage :
256
Abstract :
In this paper, a new feature normalization approach, based on the cumulative density function (CDF) matching principle, is proposed. Since speech features in noisy environments usually follow bimodal distributions, we fully utilize this characteristic by representing the CDF of the features with a double Gaussian model. A feature normalization process is performed according to the estimated CDF. The experimental results on the Aurora2 database show that the performance of our method is much better than that of the conventional mean and variance normalization (MVN) method, and comparable to that of the method combining spectral subtraction and histogram equalization (HE). Moreover, further improvement has been gained by combining our method with a simple temporal feature smoothing process. This result suggests that our new method has the potential to be integrated with other techniques to provide even better performance.
Keywords :
Gaussian distribution; higher order statistics; signal denoising; smoothing methods; speech recognition; CDF matching principle; bimodal distribution; cumulative density function; double Gaussian based feature normalization; noisy environment speech features; robust speech recognition; temporal feature smoothing process; Density functional theory; Helium; Histograms; Parametric statistics; Robustness; Spatial databases; Speech enhancement; Speech recognition; Testing; Working environment noise;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Chinese Spoken Language Processing, 2004 International Symposium on
Print_ISBN :
0-7803-8678-7
Type :
conf
DOI :
10.1109/CHINSL.2004.1409634
Filename :
1409634
Link To Document :
بازگشت