• DocumentCode
    1749714
  • Title

    On-line learning of language models with word error probability distributions

  • Author

    Gretter, Roberto ; Riccardi, Giuseppe

  • Author_Institution
    Trento Univ., Italy
  • Volume
    1
  • fYear
    2001
  • fDate
    2001
  • Firstpage
    557
  • Abstract
    We are interested in the problem of learning stochastic language models on-line (without speech transcriptions) for adaptive speech recognition and understanding. We propose an algorithm to adapt to variations in the language model distributions based on speech input only and without its true transcription. The on-line probability estimate is defined. as a function of the prior and word error distributions. We show the effectiveness of word-lattice based error probability distributions in terms of receiver operating characteristics (ROC) curves and word accuracy. We apply the new estimates Padapt (w) to the task of adapting on-line an initial large vocabulary trigram language model and show improvement in word accuracy with respect to the baseline speech recognizer
  • Keywords
    error statistics; natural languages; probability; speech recognition; unsupervised learning; ROC curves; adaptive speech recognition; baseline speech recognizer; large vocabulary trigram language model; online learning; receiver operating characteristics curves; speech understanding; stochastic language models; word accuracy; word error probability distributions; Accuracy; Error analysis; Error probability; Lattices; Natural languages; Probability distribution; Speech recognition; Stochastic processes; Topology; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
  • Conference_Location
    Salt Lake City, UT
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7041-4
  • Type

    conf

  • DOI
    10.1109/ICASSP.2001.940892
  • Filename
    940892