• DocumentCode
    337468
  • Title

    Speech recognition experiments using multi-span statistical language models

  • Author

    Bellegarda, Jerome R.

  • Author_Institution
    Spoken Language Group, Apple Comput. Inc., Cupertino, CA, USA
  • Volume
    2
  • fYear
    1999
  • fDate
    15-19 Mar 1999
  • Firstpage
    717
  • Abstract
    A multi-span framework was proposed to integrate the various constraints, both local and global, that are present in the language. In this approach, local constraints are captured via n-gram language modeling, while global constraints are taken into account through the use of latent semantic analysis. The performance of the resulting multi-span language models, as measured by the perplexity, has been shown to compare favorably with the corresponding n-gram performance. This paper reports on actual speech recognition experiments, and shows that word error rate is also substantially reduced. On a subset of the Wall Street Journal speaker-independent, 20,000-word vocabulary, continuous speech task, the multi-span framework resulted in a reduction in average word error rate of up to 17%
  • Keywords
    grammars; speech recognition; statistical analysis; Wall Street Journal speaker-independent vocabulary; average word error rate reduction; continuous speech task; global constraints; latent semantic analysis; local constraints; multi-span statistical language models; n-gram language modeling; n-gram performance; performance; perplexity; speech recognition experiments; word error rate; Data mining; Displays; Error analysis; Frequency; History; Natural languages; Robustness; Speech recognition; Training data; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
  • Conference_Location
    Phoenix, AZ
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-5041-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.1999.759767
  • Filename
    759767