• DocumentCode
    3484844
  • Title

    Multi-level context-dependent acoustic modeling for automatic speech recognition

  • Author

    Chang, Hung-An ; Glass, James

  • Author_Institution
    MIT Comput. Sci. & Artificial Intell. Lab., Cambridge, MA, USA
  • fYear
    2011
  • fDate
    11-15 Dec. 2011
  • Firstpage
    89
  • Lastpage
    94
  • Abstract
    In this paper, we propose a multi-level, context-dependent acoustic modeling framework for automatic speech recognition. For each context-dependent unit considered by the recognizer, we construct a set of classifiers that target different amounts of contextual resolution, and then combine them for scoring. Since information from multiple levels of contexts is appropriately combined, the proposed modeling framework provides reasonable scores for units with few or no training examples, while maintaining an ability to distinguish between different context-dependent units. On a large vocabulary lecture transcription task, the proposed modeling framework outperforms a traditional clustering-based context-dependent acoustic model by 3.5% (11.4% relative) in terms of word error rate.
  • Keywords
    hidden Markov models; speech recognition; automatic speech recognition; multilevel context-dependent acoustic modeling; Acoustics; Computational modeling; Context; Context modeling; Data models; Hidden Markov models; Training;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition and Understanding (ASRU), 2011 IEEE Workshop on
  • Conference_Location
    Waikoloa, HI
  • Print_ISBN
    978-1-4673-0365-1
  • Electronic_ISBN
    978-1-4673-0366-8
  • Type

    conf

  • DOI
    10.1109/ASRU.2011.6163911
  • Filename
    6163911