DocumentCode
312178
Title
Iterative unsupervised adaptation using maximum likelihood linear regression
Author
Woodland, P.C. ; Pye, D. ; Gales, M.J.F.
Author_Institution
Dept. of Eng., Cambridge Univ., UK
Volume
2
fYear
1996
fDate
3-6 Oct 1996
Firstpage
1133
Abstract
Maximum likelihood linear regression (MLLR) is a parameter transformation technique for both speaker and environment adaptation. In this paper, the iterative use of MLLR is investigated in the context of large-vocabulary speaker-independent transcription of both noise-free and noisy data. It is shown that iterative application of MLLR can be beneficial especially in situations of severe mismatch. When word lattices are used, it is important that the lattices contain the correct transcription, and it is shown that global MLLR based on rough initial transcriptions of the data can be very useful in generating high-quality lattices. MLLR can also be used in an iterative fashion to refine the transcriptions of the test data and to adapt models based on the current transcriptions. These techniques were used by the HTK large-vocabulary speech recognition system for the November 1995 ARPA H3 evaluation. It is shown that iterative-application MLLR proved to be very effective prior to lattice generation and for iterative refinement
Keywords
adaptive systems; iterative methods; maximum likelihood estimation; speech recognition; unsupervised learning; vocabulary; ARPA H3 evaluation; HTK large-vocabulary speech recognition system; environment adaptation; iterative refinement; iterative unsupervised adaptation; large-vocabulary speaker-independent transcription; maximum likelihood linear regression; noise-free data; noisy data; parameter transformation technique; rough initial transcriptions; severe mismatch; speaker adaptation; word lattice generation; Additive noise; Lattices; Linear regression; Maximum likelihood linear regression; Microphones; Speech enhancement; Speech recognition; Testing; Vocabulary; Working environment noise;
fLanguage
English
Publisher
ieee
Conference_Titel
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location
Philadelphia, PA
Print_ISBN
0-7803-3555-4
Type
conf
DOI
10.1109/ICSLP.1996.607806
Filename
607806
Link To Document