• DocumentCode
    3207690
  • Title

    A lip reading method based on 3-D DCT and 3-D HMM

  • Author

    Min, Kim Yong ; Zuo, Li Hong

  • Author_Institution
    Math. Fac., Kim Il Sung Univ., Pyongyang, South Korea
  • Volume
    1
  • fYear
    2011
  • fDate
    29-31 July 2011
  • Abstract
    Lip reading aims at recognizing what human says by analyzing visual speech information, such as lip movement. This technique is used to improve the recognition rate under the noise environment. Now, more lip feature extracting methods are developed, most of which are sensitive to lip positioning. To extract the features using Hidden Markov Model can be avoided such defect. Lip movement videos have 3-D structure. Therefore, we extend Hidden Markov Models to 3-dimensional space, construct a lip movement model using it. In order to consider lip dynamic features, we extract the feature vectors using 3-D Discrete Cosine Transform. 3-D DCT and 3-D HMM based lip reading method has following advantages: It can consider dynamic features of the lip movement. It can be robust on rotation, parallel shift and variant scaling. We tested its performance on VidTIMIT database compared with Pseudo 3-D HMM based method. Our method can increase the recognition rate about 2~3% against P3-D HMM based method.
  • Keywords
    discrete cosine transforms; feature extraction; hidden Markov models; object recognition; speech processing; speech recognition; 3D DCT; VidTIMIT database; discrete cosine transform; hidden Markov models; lip feature extracting methods; lip movement videos; lip reading method; noise environment; parallel shift; pseudo 3D HMM based method; rotation; variant scaling; visual speech information analysis; Analytical models; Authentication; Educational institutions; Face recognition; Hidden Markov models; Solid modeling; 3-D DCT; 3-D HMM; Lip reading;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Electronics and Optoelectronics (ICEOE), 2011 International Conference on
  • Conference_Location
    Dalian
  • Print_ISBN
    978-1-61284-275-2
  • Type

    conf

  • DOI
    10.1109/ICEOE.2011.6013060
  • Filename
    6013060