DocumentCode
394319
Title
Language model switching based on topic detection for dialog speech recognition
Author
Lane, Ian R. ; Kawahara, Tatsuya ; Matsui, Tomoko
Author_Institution
Sch. of Informatics, Kyoto Univ., Japan
Volume
1
fYear
2003
fDate
6-10 April 2003
Abstract
An efficient, scalable speech recognition architecture is proposed for multidomain dialog systems by combining topic detection and topic-dependent language modeling. The inferred domain is automatically detected from the user´s utterance, and speech recognition is then performed with an appropriate domain-dependent language model. The architecture improves accuracy and efficiency over current approaches and is scaleable to a large number of domains. In this paper, a novel framework using a multilayer hierarchy of language models is introduced in order to improve robustness against topic detection errors. The proposed system provides a relative reduction in WER of 10.5% over a single language model system. Furthermore it achieves an accuracy that is comparable to using multiple language models in parallel while using only a fraction of the computational cost.
Keywords
interactive systems; natural language interfaces; speech recognition; WER; accuracy; dialog speech recognition; domain-dependent language model; language model switching; multidomain dialog systems; multilayer hierarchy; robustness; scalable speech recognition architecture; topic detection; topic detection errors; topic-dependent language modeling; Computational efficiency; Decoding; Informatics; Laboratories; Natural languages; Robustness; Routing; Speech recognition; Switches; Usability;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-7663-3
Type
conf
DOI
10.1109/ICASSP.2003.1198856
Filename
1198856
Link To Document