Language model switching based on topic detection for dialog speech recognition

Author

Lane, Ian R. ; Kawahara, Tatsuya ; Matsui, Tomoko

Author_Institution

Sch. of Informatics, Kyoto Univ., Japan

Volume

1

fYear

2003

fDate

6-10 April 2003

Abstract

An efficient, scalable speech recognition architecture is proposed for multidomain dialog systems by combining topic detection and topic-dependent language modeling. The inferred domain is automatically detected from the user´s utterance, and speech recognition is then performed with an appropriate domain-dependent language model. The architecture improves accuracy and efficiency over current approaches and is scaleable to a large number of domains. In this paper, a novel framework using a multilayer hierarchy of language models is introduced in order to improve robustness against topic detection errors. The proposed system provides a relative reduction in WER of 10.5% over a single language model system. Furthermore it achieves an accuracy that is comparable to using multiple language models in parallel while using only a fraction of the computational cost.

Keywords

interactive systems; natural language interfaces; speech recognition; WER; accuracy; dialog speech recognition; domain-dependent language model; language model switching; multidomain dialog systems; multilayer hierarchy; robustness; scalable speech recognition architecture; topic detection; topic detection errors; topic-dependent language modeling; Computational efficiency; Decoding; Informatics; Laboratories; Natural languages; Robustness; Routing; Speech recognition; Switches; Usability;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on

ISSN

1520-6149

Print_ISBN

0-7803-7663-3

Type

conf

DOI

10.1109/ICASSP.2003.1198856

Filename

1198856