DocumentCode
323496
Title
A combination of discriminative and maximum likelihood techniques for noise robust speech recognition
Author
Laurila, Kari ; Vasilache, Marcel ; Viikki, Olli
Author_Institution
Speech & Audio Syst. Lab., Nokia Res. Center, Tampere, Finland
Volume
1
fYear
1998
fDate
12-15 May 1998
Firstpage
85
Abstract
We study how discriminative and maximum likelihood (ML) techniques should be combined in order to maximize the recognition accuracy of a speaker-independent automatic speech recognition (ASR) system that includes speaker adaptation. We compare two training approaches for the speaker-independent case and examine how well they perform together with four different speaker adaptation schemes. In a noise robust connected digit recognition task we show that the minimum classification error (MCE) training approach for speaker-independent modelling together with the Bayesian speaker adaptation scheme provide the highest classification accuracy over the whole lifespan of an ASR system. With the MCE training we are capable of reducing the recognition errors by 30% over the ML approach in the speaker-independent case. With the Bayesian speaker adaptation scheme we can further reduce the error rates by 62% using only as few as five adaptation utterances
Keywords
Bayes methods; error statistics; hidden Markov models; maximum likelihood estimation; noise; pattern classification; speech recognition; Bayesian speaker adaptation; HMM; MCE training; ML approach; adaptation utterances; car environment; classification accuracy; connected digit recognition task; discriminative techniques; error rate reduction; hands-free voice dialling; maximum likelihood techniques; minimum classification error training; noise robust speech recognition; recognition accuracy; recognition error reduction; speaker-independent automatic speech recognition; Automatic speech recognition; Bayesian methods; Hidden Markov models; Maximum likelihood estimation; Noise robustness; Speech enhancement; Speech recognition; Target recognition; Training data; Working environment noise;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
Conference_Location
Seattle, WA
ISSN
1520-6149
Print_ISBN
0-7803-4428-6
Type
conf
DOI
10.1109/ICASSP.1998.674373
Filename
674373
Link To Document