• DocumentCode
    312178
  • Title

    Iterative unsupervised adaptation using maximum likelihood linear regression

  • Author

    Woodland, P.C. ; Pye, D. ; Gales, M.J.F.

  • Author_Institution
    Dept. of Eng., Cambridge Univ., UK
  • Volume
    2
  • fYear
    1996
  • fDate
    3-6 Oct 1996
  • Firstpage
    1133
  • Abstract
    Maximum likelihood linear regression (MLLR) is a parameter transformation technique for both speaker and environment adaptation. In this paper, the iterative use of MLLR is investigated in the context of large-vocabulary speaker-independent transcription of both noise-free and noisy data. It is shown that iterative application of MLLR can be beneficial especially in situations of severe mismatch. When word lattices are used, it is important that the lattices contain the correct transcription, and it is shown that global MLLR based on rough initial transcriptions of the data can be very useful in generating high-quality lattices. MLLR can also be used in an iterative fashion to refine the transcriptions of the test data and to adapt models based on the current transcriptions. These techniques were used by the HTK large-vocabulary speech recognition system for the November 1995 ARPA H3 evaluation. It is shown that iterative-application MLLR proved to be very effective prior to lattice generation and for iterative refinement
  • Keywords
    adaptive systems; iterative methods; maximum likelihood estimation; speech recognition; unsupervised learning; vocabulary; ARPA H3 evaluation; HTK large-vocabulary speech recognition system; environment adaptation; iterative refinement; iterative unsupervised adaptation; large-vocabulary speaker-independent transcription; maximum likelihood linear regression; noise-free data; noisy data; parameter transformation technique; rough initial transcriptions; severe mismatch; speaker adaptation; word lattice generation; Additive noise; Lattices; Linear regression; Maximum likelihood linear regression; Microphones; Speech enhancement; Speech recognition; Testing; Vocabulary; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
  • Conference_Location
    Philadelphia, PA
  • Print_ISBN
    0-7803-3555-4
  • Type

    conf

  • DOI
    10.1109/ICSLP.1996.607806
  • Filename
    607806