DocumentCode
323764
Title
Two-step generation of variable-word-length language model integrating local and global constraints
Author
Matsunaga, Shoichi ; Sagayama, Shigeki
Author_Institution
NTT Human Interface Labs., Kanagawa, Japan
Volume
2
fYear
1998
fDate
12-15 May 1998
Firstpage
697
Abstract
This paper proposes two-step generation of a variable-length class-based language model that integrates local and global constraints. In the first-step, an initial class set is recursively designed using local constraints. Word elements for each class are determined using Kullback divergence and total entropy. In the second step, the word classes are recursively and words are iteratively recreated, by grouping consecutive words to generate longer units and by splitting the initial classes into finer classes. These operations in the second step are carried out selectively, taking into account local and global constraints on the basis of a minimum entropy criterion. Experiments showed that the perplexity of the proposed initial class set is superior to that of the conventional part-of-speech class, and the perplexity of the variable-word-length model consequently becomes lower. Furthermore, this two-step model generation approach greatly reduces the training time
Keywords
grammars; iterative methods; minimum entropy methods; natural languages; speech processing; speech recognition; Kullback divergence; class-based language model; experiments; global constraints; iterative word-class; large vocabulary continuous speech recognition; local constraints; minimum entropy criterion; part-of-speech class; perplexity; total entropy; training time reduction; two-step model generation; variable-word-length language model; Broadcasting; Entropy; Gratings; Humans; Natural languages; Power generation; Speech recognition; Testing; Vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
Conference_Location
Seattle, WA
ISSN
1520-6149
Print_ISBN
0-7803-4428-6
Type
conf
DOI
10.1109/ICASSP.1998.675360
Filename
675360
Link To Document