• DocumentCode
    698468
  • Title

    Audio-visual speech recognition with a hybrid SVM-HMM system

  • Author

    Gurban, Mihai ; Thiran, Jean-Philippe

  • Author_Institution
    Signal Process. Inst., Ecole Polytech. Fed. de Lausanne (EPFL), Lausanne, Switzerland
  • fYear
    2005
  • fDate
    4-8 Sept. 2005
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    Traditional speech recognition systems use Gaussian mixture models to obtain the likelihoods of individual phonemes, which are then used as state emission probabilities in hidden Markov models representing the words. In hybrid systems, the Gaussian mixtures are replaced by more discriminant classifiers, leading to an improved performance. Most of the time the classifiers used in such systems are neural networks. Support vector machines have also been used in one-modality audio or visual speech recognition, but never in a multimodal audio-visual system. We propose such a hybrid SVM-HMM speech recognizer, and we show how the multimodal approach leads to better performance than that obtained with any of the two modalities individually.
  • Keywords
    audio-visual systems; hidden Markov models; neural nets; signal classification; speech recognition; support vector machines; audio-visual speech recognition; discriminant classifiers; hidden Markov models; hybrid SVM-HMM system; individual phoneme likelihoods; neural networks; state emission probabilities; support vector machines; Accuracy; Hidden Markov models; Signal to noise ratio; Speech; Speech recognition; Support vector machines; Visualization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference, 2005 13th European
  • Conference_Location
    Antalya
  • Print_ISBN
    978-160-4238-21-1
  • Type

    conf

  • Filename
    7078053