• DocumentCode
    2793495
  • Title

    Information retrieval methods for automatic speech recognition

  • Author

    Xiao, Xiaoqiang ; Droppo, Jasha ; Acero, Alex

  • Author_Institution
    Dept. of Electr. Eng., Pennsylvania State Univ., University Park, PA, USA
  • fYear
    2010
  • fDate
    14-19 March 2010
  • Firstpage
    5550
  • Lastpage
    5553
  • Abstract
    In this paper, we use information retrieval (IR) techniques to improve a speech recognition (ASR) system. The potential benefits include improved speed, accuracy, and scalability. Where conventional HMM-based speech recognition systems decode words directly, our IR-based system first decodes subword units. These are then mapped to a target word by the IR system. In this decoupled system, the IR serves as a lightweight, data-driven pronunciation model. Our proposed method is evaluated in the Windows Live Search for Mobile (WLS4M) task, and our best system has 12% fewer errors than a comparable HMM classifier. We show that even using an inexpensive IR weighting scheme (TF-IDF) yields a 3% relative error rate reduction while maintaining all of the advantages of the IR approach.
  • Keywords
    hidden Markov models; information retrieval; speech recognition; HMM; automatic speech recognition; data-driven pronunciation model; decoupled system; hidden Markov models; information retrieval weighting scheme; windows live search for mobile task; Automatic speech recognition; Background noise; Decoding; Dictionaries; Engines; Feature extraction; Hidden Markov models; Information retrieval; Power system modeling; Speech recognition; Direct Modeling; Information Retrieval; Language Model; Speech Recognition; Vector Space Model;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
  • Conference_Location
    Dallas, TX
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-4295-9
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2010.5495229
  • Filename
    5495229