Title :
Support vector machines for noise robust ASR
Author :
Gales, M.J.F. ; Ragni, A. ; AlDamarki, H. ; Gautier, C.
Author_Institution :
Eng. Dept., Cambridge Univ., Cambridge, UK
fDate :
Nov. 13 2009-Dec. 17 2009
Abstract :
Using discriminative classifiers, such as Support Vector Machines (SVMs) in combination with, or as an alternative to, Hidden Markov Models (HMMs) has a number of advantages for difficult speech recognition tasks. For example, the models can make use of additional dependencies in the observation sequences than HMMs provided the appropriate form of kernel is used. However standard SVMs are binary classifiers, and speech is a multi-class problem. Furthermore, to train SVMs to distinguish word pairs requires that each word appears in the training data. This paper examines both of these limitations. Tree-based reduction approaches for multiclass classification are described, as well as some of the issues in applying them to dynamic data, such as speech. To address the training data issues, a simplified version of HMM-based synthesis can be used, which allows data for any word-pair to be generated. These approaches are evaluated on two noise corrupted digit sequence tasks: AURORA 2.0; and actual in-car collected data.
Keywords :
hidden Markov models; noise; pattern classification; speech recognition; support vector machines; trees (mathematics); word processing; AURORA 2.0; binary classifiers; hidden Markov models; noise corrupted digit sequence tasks; noise robust ASR; speech recognition; support vector machines; tree based reduction approaches; Automatic speech recognition; Classification tree analysis; Hidden Markov models; Kernel; Noise robustness; Speech recognition; Speech synthesis; Support vector machine classification; Support vector machines; Training data;
Conference_Titel :
Automatic Speech Recognition & Understanding, 2009. ASRU 2009. IEEE Workshop on
Conference_Location :
Merano
Print_ISBN :
978-1-4244-5478-5
Electronic_ISBN :
978-1-4244-5479-2
DOI :
10.1109/ASRU.2009.5372913