Title :
Multivariate-Gaussian-based cepstral normalization for robust speech recognition
Author :
P.J. Moreno;B. Raj;E. Gouvea;R.M. Stern
Author_Institution :
Dept. of Electr. & Comput. Eng., Carnegie Mellon Univ., Pittsburgh, PA, USA
Abstract :
We introduce a new family of environmental compensation algorithms called multivariate gaussian based cepstral normalization (RATZ). RATZ assumes that the effects of unknown noise and filtering on speech features can be compensated by corrections to the mean and variance of components of Gaussian mixtures, and an efficient procedure for estimating the correction factors is provided. The RATZ algorithm can be implemented to work with or without the use of "stereo" development data that had been simultaneously recorded in the training and testing environments. "Blind" RATZ partially overcomes the loss of information that would have been provided by stereo training through the use of a more accurate description of how noisy environments affect clean speech. We evaluate the performance of the two RATZ algorithms using the CMU SPHINX-II system on the alphanumeric census database and compare their performance with that of previous environmental-robustness developed at CMU.
Keywords :
"Cepstral analysis","Robustness","Speech recognition","Speech enhancement","Degradation","Working environment noise","Additive noise","Speech analysis","Statistics","Databases"
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Print_ISBN :
0-7803-2431-5
DOI :
10.1109/ICASSP.1995.479292