DocumentCode
1290644
Title
Variable n-grams and extensions for conversational speech language modeling
Author
Siu, Manhung ; Ostendorf, Mari
Author_Institution
Hong Kong Univ. of Sci. & Technol., Kowloon, Hong Kong
Volume
8
Issue
1
fYear
2000
fDate
1/1/2000 12:00:00 AM
Firstpage
63
Lastpage
75
Abstract
Recent progress in variable n-gram language modeling provides an efficient representation of n-gram models and makes training of higher order n grams possible. We apply the variable n-gram design algorithm to conversational speech, extending the algorithm to learn skips and context-dependent classes to handle conversational speech characteristics such as filler words, repetitions, and other disfluencies. Experiments show that using the extended variable n-gram results in a language model that captures 4-gram context with less than half the parameters of a standard trigram while also improving the test perplexity and recognition accuracy
Keywords
computational linguistics; natural languages; speech recognition; context-dependent classes; conversational speech language modeling; experiments; speech recognition; training; trigram; variable n-gram language modeling; Algorithm design and analysis; Context modeling; Costs; Helium; History; Natural languages; Parameter estimation; Speech recognition; Testing; Vocabulary;
fLanguage
English
Journal_Title
Speech and Audio Processing, IEEE Transactions on
Publisher
ieee
ISSN
1063-6676
Type
jour
DOI
10.1109/89.817454
Filename
817454
Link To Document