Title :
Robust speech recognition using multiple prior models for speech reconstruction
Author :
Narayanan, Arun ; Zhao, Xiaojia ; Wang, DeLiang ; Fosler-Lussier, Eric
Author_Institution :
Dept. of Comput. Sci. & Eng., Ohio State Univ., Columbus, OH, USA
Abstract :
Prior models of speech have been used in robust automatic speech recognition to enhance noisy speech. Typically, a single prior model is trained by pooling the entire training data. In this paper we propose to train multiple prior models of speech instead of a single prior model. The prior models can be trained based on distinct characteristics of speech. In this study, they are trained based on voicing characteristics. The trained prior models are then used to reconstruct noisy speech. Significant improvements are obtained on the Aurora-4 robust speech recognition task when multiple priors are used; in conjunction with an uncertainty transform technique, multiple priors yield a 13.7% absolute improvement in the average word error rate over directly recognizing noisy speech.
Keywords :
signal reconstruction; speech enhancement; speech recognition; Aurora-4 robust speech recognition; multiple prior models; noisy speech enhancement; speech reconstruction; voicing characteristics; Hidden Markov models; Noise; Noise measurement; Robustness; Speech; Speech recognition; Uncertainty; Aurora-4; CASA; Robust ASR; feature reconstruction; uncertainty transform;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2011.5947429