DocumentCode :
353737
Title :
Frame-discriminative and confidence-driven adaptation for LVCSR
Author :
Wallhoff, Frank ; Willett, Daniel ; Rigoll, Gerhard
Author_Institution :
Dept. of Comput. Sci., Gerhard-Mercator Univ. Duisberg, Germany
Volume :
3
fYear :
2000
fDate :
2000
Firstpage :
1835
Abstract :
Maximum likelihood linear regression (MLLR) has become the most popular approach for adapting speaker-independent hidden Markov models to a specific speaker´s characteristics. However, it is well known, that discriminative training objectives outperform maximum likelihood training approaches, especially in cases where training data is very limited, as it always is the case in adaptation tasks. Therefore, this paper explores the application of a frame-based discriminative training objective for adaptation. It presents evaluations for supervised as well as for unsupervised adaption on the 1993 WSJ adaptation tests of native and non-native speakers. Relative improvements in word error rate of up to 25% could be measured compared to the MLLR adapted recognition systems. Along with unsupervised adaptation, the paper also presents the improvements achieved by the application of confidence measures. They provided an average relative improvement of 10% compared to ordinary unsupervised MLLR
Keywords :
adaptive systems; hidden Markov models; maximum likelihood estimation; speech recognition; 1993 WSJ adaptation tests; LVCSR; MLLR adapted recognition systems; confidence-driven adaptation; discriminative training objectives; frame-discriminative adaptation; large vocabulary continuous speech recognition; maximum likelihood linear regression; maximum likelihood training approaches; speaker-independent hidden Markov models; unsupervised adaptation; word error rate; Character recognition; Computer science; Hidden Markov models; Loudspeakers; Maximum likelihood estimation; Maximum likelihood linear regression; Parameter estimation; Speech recognition; Training data; Viterbi algorithm;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
Conference_Location :
Istanbul
ISSN :
1520-6149
Print_ISBN :
0-7803-6293-4
Type :
conf
DOI :
10.1109/ICASSP.2000.862112
Filename :
862112
Link To Document :
بازگشت