• DocumentCode
    2964430
  • Title

    Support vector machines for noise robust ASR

  • Author

    Gales, M.J.F. ; Ragni, A. ; AlDamarki, H. ; Gautier, C.

  • Author_Institution
    Eng. Dept., Cambridge Univ., Cambridge, UK
  • fYear
    2009
  • fDate
    Nov. 13 2009-Dec. 17 2009
  • Firstpage
    205
  • Lastpage
    210
  • Abstract
    Using discriminative classifiers, such as Support Vector Machines (SVMs) in combination with, or as an alternative to, Hidden Markov Models (HMMs) has a number of advantages for difficult speech recognition tasks. For example, the models can make use of additional dependencies in the observation sequences than HMMs provided the appropriate form of kernel is used. However standard SVMs are binary classifiers, and speech is a multi-class problem. Furthermore, to train SVMs to distinguish word pairs requires that each word appears in the training data. This paper examines both of these limitations. Tree-based reduction approaches for multiclass classification are described, as well as some of the issues in applying them to dynamic data, such as speech. To address the training data issues, a simplified version of HMM-based synthesis can be used, which allows data for any word-pair to be generated. These approaches are evaluated on two noise corrupted digit sequence tasks: AURORA 2.0; and actual in-car collected data.
  • Keywords
    hidden Markov models; noise; pattern classification; speech recognition; support vector machines; trees (mathematics); word processing; AURORA 2.0; binary classifiers; hidden Markov models; noise corrupted digit sequence tasks; noise robust ASR; speech recognition; support vector machines; tree based reduction approaches; Automatic speech recognition; Classification tree analysis; Hidden Markov models; Kernel; Noise robustness; Speech recognition; Speech synthesis; Support vector machine classification; Support vector machines; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition & Understanding, 2009. ASRU 2009. IEEE Workshop on
  • Conference_Location
    Merano
  • Print_ISBN
    978-1-4244-5478-5
  • Electronic_ISBN
    978-1-4244-5479-2
  • Type

    conf

  • DOI
    10.1109/ASRU.2009.5372913
  • Filename
    5372913