DocumentCode :
3162001
Title :
Feature compensation employing online GMM adaptation for speech recognition in unknown severely adverse environments
Author :
Kim, Wooil ; Hansen, John H L
Author_Institution :
Center for Robust Speech Syst. (CRSS), Univ. of Texas at Dallas, Richardson, TX, USA
fYear :
2012
fDate :
25-30 March 2012
Firstpage :
4121
Lastpage :
4124
Abstract :
This study proposes an effective feature compensation-method to improve speech recognition in real-life speech conditions, where (i) severe background noise and channel distortion simultaneously exist, (ii) no development data is available, and (iii) clean data for ASR training and the latent clean speech in the test data are mismatched in the acoustic structure. The proposed feature compensation method employs an online GMM adaptation procedure which is based on MLLR, and a minimum statistics replacement technique for non-speech segments. The DARPA Tank corpus is used for performance evaluation, which includes severe real-life noisy conditions. The clean Broadcast News (BN) corpus is used for training the speech recognition system in this study. Experimental results show that the proposed feature compensation scheme outperforms GMM-based FMLLR and the ETSI AFE for DARPA Tank data, achieving a +5.56% relative improvement compared to FMLLR. These results demonstrate that the proposed feature compensation scheme is effective at improving speech recognition performance in unknown real-life adverse environments.
Keywords :
Gaussian distribution; maximum likelihood estimation; regression analysis; speech recognition; ASR training; DARPA tank corpus; ETSI AFE; Gaussian mixture model; acoustic structure; background noise; channel distortion; clean broadcast news corpus; feature compensation; latent clean speech; maximum likelihood linear regression; online GMM adaptation; real-life speech conditions; speech recognition; statistics replacement; unknown severely adverse environments; Hidden Markov models; Noise measurement; Speech; Speech recognition; Telecommunication standards; Training; Vectors; DARPA Tank corpus; GMMadaptation; feature compensation; minimum statistics replacement; robust speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
ISSN :
1520-6149
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2012.6288825
Filename :
6288825
Link To Document :
بازگشت