Title :
Initial language models for spoken dialogue systems
Author :
Kellner, Andreas
Author_Institution :
Philips GmbH Forschungslab., Aachen, Germany
Abstract :
The estimation of initial language models for new applications of spoken dialogue systems without large task specific training corpora is becoming an increasingly important issue. This paper investigates two different approaches in which the task-specific knowledge contained in the language understanding grammar is exploited in order to generate n-gram language models for the speech recognizer: The first uses class-based language models for which the word-classes are automatically derived from the grammar. In the second approach, language models are estimated on artificial corpora which have been created from the understanding grammar. The application of fill-up techniques allows the combination of the strengths of both approaches and leads to a language model which shows optimal performance regardless of the amount of training data available. Perplexities and word error rates are reported for two different domains
Keywords :
context-free grammars; interactive systems; knowledge based systems; linguistics; speech recognition; artificial corpora; context free grammar; fill-up techniques; initial language models; language understanding grammar; n-gram language models; speech recognition; spoken dialogue systems; task-specific knowledge; word-class; Acoustic applications; Context modeling; Educational technology; Error analysis; Integrated circuit modeling; Natural languages; Speech recognition; Stochastic processes; Stochastic systems; Training data;
Conference_Titel :
Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
Conference_Location :
Seattle, WA
Print_ISBN :
0-7803-4428-6
DOI :
10.1109/ICASSP.1998.674398