DocumentCode
698468
Title
Audio-visual speech recognition with a hybrid SVM-HMM system
Author
Gurban, Mihai ; Thiran, Jean-Philippe
Author_Institution
Signal Process. Inst., Ecole Polytech. Fed. de Lausanne (EPFL), Lausanne, Switzerland
fYear
2005
fDate
4-8 Sept. 2005
Firstpage
1
Lastpage
4
Abstract
Traditional speech recognition systems use Gaussian mixture models to obtain the likelihoods of individual phonemes, which are then used as state emission probabilities in hidden Markov models representing the words. In hybrid systems, the Gaussian mixtures are replaced by more discriminant classifiers, leading to an improved performance. Most of the time the classifiers used in such systems are neural networks. Support vector machines have also been used in one-modality audio or visual speech recognition, but never in a multimodal audio-visual system. We propose such a hybrid SVM-HMM speech recognizer, and we show how the multimodal approach leads to better performance than that obtained with any of the two modalities individually.
Keywords
audio-visual systems; hidden Markov models; neural nets; signal classification; speech recognition; support vector machines; audio-visual speech recognition; discriminant classifiers; hidden Markov models; hybrid SVM-HMM system; individual phoneme likelihoods; neural networks; state emission probabilities; support vector machines; Accuracy; Hidden Markov models; Signal to noise ratio; Speech; Speech recognition; Support vector machines; Visualization;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing Conference, 2005 13th European
Conference_Location
Antalya
Print_ISBN
978-160-4238-21-1
Type
conf
Filename
7078053
Link To Document