• DocumentCode
    542691
  • Title

    A coupled HMM for audio-visual speech recognition

  • Author

    Nefian, Ara V. ; Liang, Luhong ; Pi, Xiaobo ; Xiaoxiang, Liu ; Mao, Crusoe ; Murphy, Kevin

  • Author_Institution
    Microcomputer Research Labs, Intel Corporation, Santa Clara, CA, 95052, USA
  • Volume
    2
  • fYear
    2002
  • fDate
    13-17 May 2002
  • Abstract
    In recent years several speech recognition systems that use visual together with audio information showed significant increase in performance over the standard speech recognition systems. The use of visual features is justified by both the bimodality of the speech generation and by the need of features that are invariant to acoustic noise perturbation. The audio-visual speech recognition system presented in this paper introduces a novel audio-visual fusion technique that uses a coupled hidden Markov model (HMM). The statistical properties of the coupled-HMM allow us to model the state asynchrony of the audio and visual observations sequences while still preserving their natural correlation over time. The experimental results show that the coupled HMM outperforms the multistream HMM in audio visual speech recognition.
  • Keywords
    Hidden Markov models; Visualization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
  • Conference_Location
    Orlando, FL, USA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7402-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2002.5745027
  • Filename
    5745027