• DocumentCode
    294550
  • Title

    An integrated grammar/bigram language model using path scores

  • Author

    Lloyd-Thomas, Harvey ; Wright, Jerry H. ; Jones, Gareth J F

  • Author_Institution
    Ensigma Ltd., Chepstow, UK
  • Volume
    1
  • fYear
    1995
  • fDate
    9-12 May 1995
  • Firstpage
    173
  • Abstract
    This paper describes a language model in which context-free grammar rules are integrated into an n-gram framework, complementing it instead of attempting to replace it. This releases the grammar from the aim of parsing sentences overall (which is often undesirable as well as unrealistic), enabling it to be employed selectively in modelling phrases that are identifiable within a flow of speech. Algorithms for model training and for sentence scoring and interpretation are described. All are based on the principle of summing over paths that span the sentence, but implementation is node-based for efficiency. Perplexity results for this system (using a hierarchy of grammars from empty to full-coverage) are compared with those for n-gram models, and the system is used for re-scoring N-best sentence lists for a speaker-independent recogniser
  • Keywords
    context-free grammars; natural languages; speech processing; speech recognition; context-free grammar rules; grammar hierarchy; integrated grammar/bigram language model; model training algorithms; n-gram framework; n-gram models; path scores; perplexity results; phrases modelling; sentence interpretation; sentence scoring; speaker-independent recogniser; speech recognition; Context modeling; Hidden Markov models; Joining processes; Mathematics; Probability; Smoothing methods; Speech; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
  • Conference_Location
    Detroit, MI
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-2431-5
  • Type

    conf

  • DOI
    10.1109/ICASSP.1995.479392
  • Filename
    479392