DocumentCode
2964430
Title
Support vector machines for noise robust ASR
Author
Gales, M.J.F. ; Ragni, A. ; AlDamarki, H. ; Gautier, C.
Author_Institution
Eng. Dept., Cambridge Univ., Cambridge, UK
fYear
2009
fDate
Nov. 13 2009-Dec. 17 2009
Firstpage
205
Lastpage
210
Abstract
Using discriminative classifiers, such as Support Vector Machines (SVMs) in combination with, or as an alternative to, Hidden Markov Models (HMMs) has a number of advantages for difficult speech recognition tasks. For example, the models can make use of additional dependencies in the observation sequences than HMMs provided the appropriate form of kernel is used. However standard SVMs are binary classifiers, and speech is a multi-class problem. Furthermore, to train SVMs to distinguish word pairs requires that each word appears in the training data. This paper examines both of these limitations. Tree-based reduction approaches for multiclass classification are described, as well as some of the issues in applying them to dynamic data, such as speech. To address the training data issues, a simplified version of HMM-based synthesis can be used, which allows data for any word-pair to be generated. These approaches are evaluated on two noise corrupted digit sequence tasks: AURORA 2.0; and actual in-car collected data.
Keywords
hidden Markov models; noise; pattern classification; speech recognition; support vector machines; trees (mathematics); word processing; AURORA 2.0; binary classifiers; hidden Markov models; noise corrupted digit sequence tasks; noise robust ASR; speech recognition; support vector machines; tree based reduction approaches; Automatic speech recognition; Classification tree analysis; Hidden Markov models; Kernel; Noise robustness; Speech recognition; Speech synthesis; Support vector machine classification; Support vector machines; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Automatic Speech Recognition & Understanding, 2009. ASRU 2009. IEEE Workshop on
Conference_Location
Merano
Print_ISBN
978-1-4244-5478-5
Electronic_ISBN
978-1-4244-5479-2
Type
conf
DOI
10.1109/ASRU.2009.5372913
Filename
5372913
Link To Document