DocumentCode
3484844
Title
Multi-level context-dependent acoustic modeling for automatic speech recognition
Author
Chang, Hung-An ; Glass, James
Author_Institution
MIT Comput. Sci. & Artificial Intell. Lab., Cambridge, MA, USA
fYear
2011
fDate
11-15 Dec. 2011
Firstpage
89
Lastpage
94
Abstract
In this paper, we propose a multi-level, context-dependent acoustic modeling framework for automatic speech recognition. For each context-dependent unit considered by the recognizer, we construct a set of classifiers that target different amounts of contextual resolution, and then combine them for scoring. Since information from multiple levels of contexts is appropriately combined, the proposed modeling framework provides reasonable scores for units with few or no training examples, while maintaining an ability to distinguish between different context-dependent units. On a large vocabulary lecture transcription task, the proposed modeling framework outperforms a traditional clustering-based context-dependent acoustic model by 3.5% (11.4% relative) in terms of word error rate.
Keywords
hidden Markov models; speech recognition; automatic speech recognition; multilevel context-dependent acoustic modeling; Acoustics; Computational modeling; Context; Context modeling; Data models; Hidden Markov models; Training;
fLanguage
English
Publisher
ieee
Conference_Titel
Automatic Speech Recognition and Understanding (ASRU), 2011 IEEE Workshop on
Conference_Location
Waikoloa, HI
Print_ISBN
978-1-4673-0365-1
Electronic_ISBN
978-1-4673-0366-8
Type
conf
DOI
10.1109/ASRU.2011.6163911
Filename
6163911
Link To Document