Title :
Adapted language modeling for recognition of retelling story in language learning
Author :
Chen, Meng ; Song, Yang ; Wang, Lan
Author_Institution :
Shenzhen Inst. of Adv. Technol., Univ. of Hong Kong, Hong Kong, China
Abstract :
N-gram language modeling typically requires large quantities of in-domain training data, i.e., data that matches the task in both topic and style. For the task of retelling stories, obtaining large volumes of speech transcriptions is often unrealistic. In this paper, we propose a novel method of language modeling using mixture models with very limited text datain the task of retelling stories. We modeled topic-specific, spoken-style, and document-style language models separately and interpolated them. We also interpolated the class-based language model with the N-gram models. Experimental results show that up to 61.6% reduction of perplexity and 20.7% reduction of word error rate (WER) have been obtained by our best performing model.
Keywords :
computer aided instruction; humanities; interpolation; natural language processing; pattern matching; speech recognition; WER; adapted language modeling; class-based language model; document-style language models; in-domain training data; language learning; mixture models; n-gram language modeling; retelling story recognition; speech transcriptions; spoken-style language models; topic-specific language models; word error rate; Adaptation models; Computational modeling; Data models; Hidden Markov models; Interpolation; Speech; Vocabulary;
Conference_Titel :
Audio, Language and Image Processing (ICALIP), 2012 International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4673-0173-2
DOI :
10.1109/ICALIP.2012.6376659