• DocumentCode
    2254008
  • Title

    Language modeling using x-grams

  • Author

    Bonafonte, Antonio ; Marino, José B.

  • Author_Institution
    Univ. Politecnica de Catalunya, Barcelona, Spain
  • Volume
    1
  • fYear
    1996
  • fDate
    3-6 Oct 1996
  • Firstpage
    394
  • Abstract
    In this paper, an extension of n-grams, called x-grams, is proposed. In this extension, the memory of the model (n) is not fixed a priori. Instead, large memories are accepted first, and merging criteria are then applied to reduce the complexity and to ensure reliable estimations. The results show how the perplexity obtained with x-grams is smaller than that of n-grams. Furthermore, the complexity is smaller than trigrams and can become close to bigrams
  • Keywords
    computational complexity; computational linguistics; grammars; linguistics; merging; modelling; natural languages; nomograms; probability; bigrams; complexity reduction; grammatical inference; language modeling; large memories; merging criteria; model memory; n-grams; perplexity; probability estimation; reliable estimations; trigrams; x-grams; Automata; Character recognition; Contracts; History; Merging; Out of order; Parameter estimation; Proposals; Speech recognition; Stochastic processes;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
  • Conference_Location
    Philadelphia, PA
  • Print_ISBN
    0-7803-3555-4
  • Type

    conf

  • DOI
    10.1109/ICSLP.1996.607137
  • Filename
    607137