• DocumentCode
    3484791
  • Title

    Don´t multiply lightly: Quantifying problems with the acoustic model assumptions in speech recognition

  • Author

    Gillick, Dan ; Gillick, Larry ; Wegmann, Steven

  • Author_Institution
    Int. Comput. Sci. Inst., Berkeley, CA, USA
  • fYear
    2011
  • fDate
    11-15 Dec. 2011
  • Firstpage
    71
  • Lastpage
    76
  • Abstract
    We describe a series of experiments simulating data from the standard Hidden Markov Model (HMM) framework used for speech recognition. Starting with a set of test transcriptions, we begin by simulating every step of the generative process. In each subsequent experiment, we substitute a real component for a simulated component (real state durations rather than simulating from the transition models, for example), and compare the word error rates of the resulting data, thus quantifying the relative costs of each modeling assumption. A novel sampling process allows us to test the independence assumptions of the HMM, which appear to present far more serious problems than the other data/model mismatches.
  • Keywords
    hidden Markov models; speech recognition; HMM; acoustic model assumptions; generative process; real component; real state durations; sampling process; simulated component; speech recognition; standard hidden Markov model framework; test transcriptions; word error rates; Acoustics; Computational modeling; Data models; Hidden Markov models; Speech; Speech recognition; Training;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition and Understanding (ASRU), 2011 IEEE Workshop on
  • Conference_Location
    Waikoloa, HI
  • Print_ISBN
    978-1-4673-0365-1
  • Electronic_ISBN
    978-1-4673-0366-8
  • Type

    conf

  • DOI
    10.1109/ASRU.2011.6163908
  • Filename
    6163908