Title :
Language model acquisition from a text corpus for speech understanding
Author :
Matsuoka, Tatsuo ; Hasson, R. ; Barlow, Michael ; Furui, Sadaoki
Author_Institution :
NTT Human Interface Labs., Tokyo, Japan
Abstract :
Speech understanding can be viewed as a problem of translating the input natural language of speech recognition results into an output semantic language. This paper describes automatic acquisition of a language model for translating natural language into semantic language from a text corpus using a stochastic method. The method estimates the co-occurrence probabilities of input and output grammar rules as a translation language model. Since the amount of text is limited, estimating a reliable language model is difficult. Therefore, we propose a method of concisely modeling input and output grammars in order to estimate a reliable translation model. Our method is shown to be effective by experiments using the ARPA ATIS task
Keywords :
grammars; language translation; natural languages; probability; semantic networks; speech recognition; stochastic processes; ARPA ATIS task; automatic acquisition; cooccurrence probabilities estimation; experiments; input grammar rules; language model acquisition; language translation; output grammar rules; output semantic language; semantic language; speech recognition results; speech understanding; stochastic method; text corpus; Context modeling; Databases; Error analysis; Humans; Laboratories; Natural languages; Speech processing; Speech recognition; Stochastic processes; Writing;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
Conference_Location :
Atlanta, GA
Print_ISBN :
0-7803-3192-3
DOI :
10.1109/ICASSP.1996.541120