Title :
Double Gaussian based feature normalization for robust speech recognition
Author :
Liu, Bo ; Li-Rong Dai ; Li, Jin-Yu ; Wang, Ren-Hua
Author_Institution :
Univ. of Sci. & Technol. of China, Anhui, China
Abstract :
In this paper, a new feature normalization approach, based on the cumulative density function (CDF) matching principle, is proposed. Since speech features in noisy environments usually follow bimodal distributions, we fully utilize this characteristic by representing the CDF of the features with a double Gaussian model. A feature normalization process is performed according to the estimated CDF. The experimental results on the Aurora2 database show that the performance of our method is much better than that of the conventional mean and variance normalization (MVN) method, and comparable to that of the method combining spectral subtraction and histogram equalization (HE). Moreover, further improvement has been gained by combining our method with a simple temporal feature smoothing process. This result suggests that our new method has the potential to be integrated with other techniques to provide even better performance.
Keywords :
Gaussian distribution; higher order statistics; signal denoising; smoothing methods; speech recognition; CDF matching principle; bimodal distribution; cumulative density function; double Gaussian based feature normalization; noisy environment speech features; robust speech recognition; temporal feature smoothing process; Density functional theory; Helium; Histograms; Parametric statistics; Robustness; Spatial databases; Speech enhancement; Speech recognition; Testing; Working environment noise;
Conference_Titel :
Chinese Spoken Language Processing, 2004 International Symposium on
Print_ISBN :
0-7803-8678-7
DOI :
10.1109/CHINSL.2004.1409634