• DocumentCode
    1686532
  • Title

    The blame game in meeting room ASR: An analysis of feature versus model errors in noisy and mismatched conditions

  • Author

    Parthasarathi, Sree Hari Krishnan ; Shuo-Yiin Chang ; Cohen, Johanne ; Morgan, Nigel ; Wegmann, Steven

  • Author_Institution
    Int. Comput. Sci. Inst., Berkeley, CA, USA
  • fYear
    2013
  • Firstpage
    6758
  • Lastpage
    6762
  • Abstract
    Given a test waveform, state-of-the-art ASR systems extract a sequence of MFCC features and decode them with a set of trained HMMs. When this test data is clean, and it matches the condition used for training the models, then there are few errors. While it is known that ASR systems are brittle in noisy or mismatched conditions, there has been little work in quantitatively attributing the errors to features or to models. This paper attributes the sources of these errors in three conditions: (a) matched near-field, (b) matched far-field, and a (c) mismatched condition. We undertake a series of diagnostic analyses employing the bootstrap method to probe a meeting room ASR system. Results show that when the conditions are matched (even if they are far-field), the model errors dominate; however, in mismatched conditions features are neither invariant nor separable and this causes as many errors as the model does.
  • Keywords
    acoustic signal processing; feature extraction; game theory; hidden Markov models; speech recognition; HMM; MFCC feature extraction; acoustic condition; blame game; bootstrap method; diagnostic analyses; feature errors; hidden Markov model; matched far-field condition; matched near-field condition; meeting room ASR system; mismatched conditions; model errors; noisy conditions; speech recognition; test waveform; Acoustics; Adaptation models; Data models; Hidden Markov models; Noise measurement; Speech recognition; Training; Features; acoustic conditions; hidden Markov models; speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
  • Conference_Location
    Vancouver, BC
  • ISSN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2013.6638970
  • Filename
    6638970