DocumentCode :
3245905
Title :
Out-of-vocabulary word recognition with a hierarchical doubly Markov language model
Author :
Kokubo, Hiroaki ; Yamamoto, Hiroshi ; Ogawa, Yoshihiko ; Sagisaka, Yoshinori ; Kikui, Genichiro
Author_Institution :
ATR Spoken Language Translation Res. Labs, Kyoto, Japan
fYear :
2003
fDate :
30 Nov.-3 Dec. 2003
Firstpage :
543
Lastpage :
547
Abstract :
We describe a novel language model for task-dependent out-of-vocabulary (OOV) words. OOV words, such as personal names and place names in a new task, can make language model adaptation difficult. To cope with this problem, we propose a hierarchical, 2-layered language model consisting of inter-word constraints and intra-word constraints. Stochastic properties of OOV words in the two constraints are represented by multi-class modeling and trained as independent Markov models. Occurrence probabilities of an OOV word are expressed by statistics of two Markov models (namely, doubly Markov model). The proposed model has been tested in a Japanese conversational speech database of appointment making. The correct word rate was improved by 7.5% from 78.2% to 86.7% when the new language model was used to recognize sentences with OOV words.
Keywords :
Markov processes; learning (artificial intelligence); natural languages; speech recognition; Japanese conversational speech database; hierarchical doubly Markov language model; independent Markov models; inter-word constraints; intra-word constraints; out-of-vocabulary word recognition; Adaptation model; Databases; Natural languages; Probability; Speech; Statistics; Stochastic processes; Testing; Training data; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Automatic Speech Recognition and Understanding, 2003. ASRU '03. 2003 IEEE Workshop on
Print_ISBN :
0-7803-7980-2
Type :
conf
DOI :
10.1109/ASRU.2003.1318498
Filename :
1318498
Link To Document :
بازگشت