• DocumentCode
    573174
  • Title

    Developing a hybrid language model for open vocabulary automatic speech recognition in a lecture speech task

  • Author

    Rondeau, Marc-Antoine ; Rose, Richard

  • Author_Institution
    Dept. of Electr. & Comput. Eng., McGill Univ., Montréal, QC, Canada
  • fYear
    2012
  • fDate
    2-5 July 2012
  • Firstpage
    114
  • Lastpage
    119
  • Abstract
    This paper addresses the problem of open vocabulary automatic speech recognition (ASR) using hybrid statistical language models (LMs). Hybrid LMs differ from closed vocabulary LMs in that the word level lexicon is augmented with an inventory of sub-lexical units (SLUs). The procedures used for selecting these SLUs and expanding out-of-vocabulary (OOV) words according to the SLUs is presented in the paper. The open-vocabulary ASR performance obtained using these techniques is presented for a lecture speech task domain.
  • Keywords
    computer aided instruction; natural language processing; speech recognition; statistical analysis; vocabulary; OOV words; SLU; closed vocabulary LM; hybrid statistical language models; lecture speech task; open vocabulary automatic speech recognition; open-vocabulary ASR performance; out-of-vocabulary words; sublexical units; word level lexicon; Accuracy; Entropy; Lattices; Probabilistic logic; Speech; Training; Vocabulary; Speech Recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Science, Signal Processing and their Applications (ISSPA), 2012 11th International Conference on
  • Conference_Location
    Montreal, QC
  • Print_ISBN
    978-1-4673-0381-1
  • Electronic_ISBN
    978-1-4673-0380-4
  • Type

    conf

  • DOI
    10.1109/ISSPA.2012.6310464
  • Filename
    6310464