DocumentCode
3484791
Title
Don´t multiply lightly: Quantifying problems with the acoustic model assumptions in speech recognition
Author
Gillick, Dan ; Gillick, Larry ; Wegmann, Steven
Author_Institution
Int. Comput. Sci. Inst., Berkeley, CA, USA
fYear
2011
fDate
11-15 Dec. 2011
Firstpage
71
Lastpage
76
Abstract
We describe a series of experiments simulating data from the standard Hidden Markov Model (HMM) framework used for speech recognition. Starting with a set of test transcriptions, we begin by simulating every step of the generative process. In each subsequent experiment, we substitute a real component for a simulated component (real state durations rather than simulating from the transition models, for example), and compare the word error rates of the resulting data, thus quantifying the relative costs of each modeling assumption. A novel sampling process allows us to test the independence assumptions of the HMM, which appear to present far more serious problems than the other data/model mismatches.
Keywords
hidden Markov models; speech recognition; HMM; acoustic model assumptions; generative process; real component; real state durations; sampling process; simulated component; speech recognition; standard hidden Markov model framework; test transcriptions; word error rates; Acoustics; Computational modeling; Data models; Hidden Markov models; Speech; Speech recognition; Training;
fLanguage
English
Publisher
ieee
Conference_Titel
Automatic Speech Recognition and Understanding (ASRU), 2011 IEEE Workshop on
Conference_Location
Waikoloa, HI
Print_ISBN
978-1-4673-0365-1
Electronic_ISBN
978-1-4673-0366-8
Type
conf
DOI
10.1109/ASRU.2011.6163908
Filename
6163908
Link To Document