• DocumentCode
    1290644
  • Title

    Variable n-grams and extensions for conversational speech language modeling

  • Author

    Siu, Manhung ; Ostendorf, Mari

  • Author_Institution
    Hong Kong Univ. of Sci. & Technol., Kowloon, Hong Kong
  • Volume
    8
  • Issue
    1
  • fYear
    2000
  • fDate
    1/1/2000 12:00:00 AM
  • Firstpage
    63
  • Lastpage
    75
  • Abstract
    Recent progress in variable n-gram language modeling provides an efficient representation of n-gram models and makes training of higher order n grams possible. We apply the variable n-gram design algorithm to conversational speech, extending the algorithm to learn skips and context-dependent classes to handle conversational speech characteristics such as filler words, repetitions, and other disfluencies. Experiments show that using the extended variable n-gram results in a language model that captures 4-gram context with less than half the parameters of a standard trigram while also improving the test perplexity and recognition accuracy
  • Keywords
    computational linguistics; natural languages; speech recognition; context-dependent classes; conversational speech language modeling; experiments; speech recognition; training; trigram; variable n-gram language modeling; Algorithm design and analysis; Context modeling; Costs; Helium; History; Natural languages; Parameter estimation; Speech recognition; Testing; Vocabulary;
  • fLanguage
    English
  • Journal_Title
    Speech and Audio Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1063-6676
  • Type

    jour

  • DOI
    10.1109/89.817454
  • Filename
    817454