DocumentCode :
3484844
Title :
Multi-level context-dependent acoustic modeling for automatic speech recognition
Author :
Chang, Hung-An ; Glass, James
Author_Institution :
MIT Comput. Sci. & Artificial Intell. Lab., Cambridge, MA, USA
fYear :
2011
fDate :
11-15 Dec. 2011
Firstpage :
89
Lastpage :
94
Abstract :
In this paper, we propose a multi-level, context-dependent acoustic modeling framework for automatic speech recognition. For each context-dependent unit considered by the recognizer, we construct a set of classifiers that target different amounts of contextual resolution, and then combine them for scoring. Since information from multiple levels of contexts is appropriately combined, the proposed modeling framework provides reasonable scores for units with few or no training examples, while maintaining an ability to distinguish between different context-dependent units. On a large vocabulary lecture transcription task, the proposed modeling framework outperforms a traditional clustering-based context-dependent acoustic model by 3.5% (11.4% relative) in terms of word error rate.
Keywords :
hidden Markov models; speech recognition; automatic speech recognition; multilevel context-dependent acoustic modeling; Acoustics; Computational modeling; Context; Context modeling; Data models; Hidden Markov models; Training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Automatic Speech Recognition and Understanding (ASRU), 2011 IEEE Workshop on
Conference_Location :
Waikoloa, HI
Print_ISBN :
978-1-4673-0365-1
Electronic_ISBN :
978-1-4673-0366-8
Type :
conf
DOI :
10.1109/ASRU.2011.6163911
Filename :
6163911
Link To Document :
بازگشت