DocumentCode :
959892
Title :
Adaptive language modeling with varied sources to cover new vocabulary items
Author :
Schwarm, Sarah E. ; Bulyko, Ivan ; Ostendorf, Mari
Author_Institution :
Dept. of Comput. Sci. & Eng., Univ. of Washington, Seattle, WA, USA
Volume :
12
Issue :
3
fYear :
2004
fDate :
5/1/2004 12:00:00 AM
Firstpage :
334
Lastpage :
342
Abstract :
N-gram language modeling typically requires large quantities of in-domain training data, i.e., data that matches the task in both topic and style. For conversational speech applications, particularly meeting transcription, obtaining large volumes of speech transcripts is often unrealistic; topics change frequently and collecting conversational-style training data is time-consuming and expensive. In particular, new topics introduce new vocabulary items which are not included in existing models. In this work, we use a variety of data sources (reflecting different sizes and styles), combined using mixture n-gram models. We study the impact of the different sources on vocabulary expansion and recognition accuracy, and investigate possible indicators of the usefulness of a data source. For the task of recognizing meeting speech, we obtain a 9% relative reduction in the overall word error rate and a 61% relative reduction in the word error rate for "new" words added to the vocabulary over a baseline language model trained from general conversational speech data.
Keywords :
natural languages; speech processing; speech recognition; vocabulary; word processing; adaptive language modeling; conversational speech; conversational-style training data; data sources; meeting transcription; mixture models; n-gram language modeling; speech recognition; speech transcripts; text normalization; vocabulary expansion; vocabulary items; vocabulary recognition; word error rate; Automatic speech recognition; Computer science; Ear; Error analysis; Natural languages; Speech recognition; Testing; Training data; Vocabulary; Web sites;
fLanguage :
English
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-6676
Type :
jour
DOI :
10.1109/TSA.2004.825666
Filename :
1288159
Link To Document :
بازگشت