• DocumentCode
    353737
  • Title

    Frame-discriminative and confidence-driven adaptation for LVCSR

  • Author

    Wallhoff, Frank ; Willett, Daniel ; Rigoll, Gerhard

  • Author_Institution
    Dept. of Comput. Sci., Gerhard-Mercator Univ. Duisberg, Germany
  • Volume
    3
  • fYear
    2000
  • fDate
    2000
  • Firstpage
    1835
  • Abstract
    Maximum likelihood linear regression (MLLR) has become the most popular approach for adapting speaker-independent hidden Markov models to a specific speaker´s characteristics. However, it is well known, that discriminative training objectives outperform maximum likelihood training approaches, especially in cases where training data is very limited, as it always is the case in adaptation tasks. Therefore, this paper explores the application of a frame-based discriminative training objective for adaptation. It presents evaluations for supervised as well as for unsupervised adaption on the 1993 WSJ adaptation tests of native and non-native speakers. Relative improvements in word error rate of up to 25% could be measured compared to the MLLR adapted recognition systems. Along with unsupervised adaptation, the paper also presents the improvements achieved by the application of confidence measures. They provided an average relative improvement of 10% compared to ordinary unsupervised MLLR
  • Keywords
    adaptive systems; hidden Markov models; maximum likelihood estimation; speech recognition; 1993 WSJ adaptation tests; LVCSR; MLLR adapted recognition systems; confidence-driven adaptation; discriminative training objectives; frame-discriminative adaptation; large vocabulary continuous speech recognition; maximum likelihood linear regression; maximum likelihood training approaches; speaker-independent hidden Markov models; unsupervised adaptation; word error rate; Character recognition; Computer science; Hidden Markov models; Loudspeakers; Maximum likelihood estimation; Maximum likelihood linear regression; Parameter estimation; Speech recognition; Training data; Viterbi algorithm;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
  • Conference_Location
    Istanbul
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-6293-4
  • Type

    conf

  • DOI
    10.1109/ICASSP.2000.862112
  • Filename
    862112