• DocumentCode
    323518
  • Title

    Initial language models for spoken dialogue systems

  • Author

    Kellner, Andreas

  • Author_Institution
    Philips GmbH Forschungslab., Aachen, Germany
  • Volume
    1
  • fYear
    1998
  • fDate
    12-15 May 1998
  • Firstpage
    185
  • Abstract
    The estimation of initial language models for new applications of spoken dialogue systems without large task specific training corpora is becoming an increasingly important issue. This paper investigates two different approaches in which the task-specific knowledge contained in the language understanding grammar is exploited in order to generate n-gram language models for the speech recognizer: The first uses class-based language models for which the word-classes are automatically derived from the grammar. In the second approach, language models are estimated on artificial corpora which have been created from the understanding grammar. The application of fill-up techniques allows the combination of the strengths of both approaches and leads to a language model which shows optimal performance regardless of the amount of training data available. Perplexities and word error rates are reported for two different domains
  • Keywords
    context-free grammars; interactive systems; knowledge based systems; linguistics; speech recognition; artificial corpora; context free grammar; fill-up techniques; initial language models; language understanding grammar; n-gram language models; speech recognition; spoken dialogue systems; task-specific knowledge; word-class; Acoustic applications; Context modeling; Educational technology; Error analysis; Integrated circuit modeling; Natural languages; Speech recognition; Stochastic processes; Stochastic systems; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
  • Conference_Location
    Seattle, WA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-4428-6
  • Type

    conf

  • DOI
    10.1109/ICASSP.1998.674398
  • Filename
    674398