DocumentCode :
2964430
Title :
Support vector machines for noise robust ASR
Author :
Gales, M.J.F. ; Ragni, A. ; AlDamarki, H. ; Gautier, C.
Author_Institution :
Eng. Dept., Cambridge Univ., Cambridge, UK
fYear :
2009
fDate :
Nov. 13 2009-Dec. 17 2009
Firstpage :
205
Lastpage :
210
Abstract :
Using discriminative classifiers, such as Support Vector Machines (SVMs) in combination with, or as an alternative to, Hidden Markov Models (HMMs) has a number of advantages for difficult speech recognition tasks. For example, the models can make use of additional dependencies in the observation sequences than HMMs provided the appropriate form of kernel is used. However standard SVMs are binary classifiers, and speech is a multi-class problem. Furthermore, to train SVMs to distinguish word pairs requires that each word appears in the training data. This paper examines both of these limitations. Tree-based reduction approaches for multiclass classification are described, as well as some of the issues in applying them to dynamic data, such as speech. To address the training data issues, a simplified version of HMM-based synthesis can be used, which allows data for any word-pair to be generated. These approaches are evaluated on two noise corrupted digit sequence tasks: AURORA 2.0; and actual in-car collected data.
Keywords :
hidden Markov models; noise; pattern classification; speech recognition; support vector machines; trees (mathematics); word processing; AURORA 2.0; binary classifiers; hidden Markov models; noise corrupted digit sequence tasks; noise robust ASR; speech recognition; support vector machines; tree based reduction approaches; Automatic speech recognition; Classification tree analysis; Hidden Markov models; Kernel; Noise robustness; Speech recognition; Speech synthesis; Support vector machine classification; Support vector machines; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Automatic Speech Recognition & Understanding, 2009. ASRU 2009. IEEE Workshop on
Conference_Location :
Merano
Print_ISBN :
978-1-4244-5478-5
Electronic_ISBN :
978-1-4244-5479-2
Type :
conf
DOI :
10.1109/ASRU.2009.5372913
Filename :
5372913
Link To Document :
بازگشت