• DocumentCode
    3137102
  • Title

    Lipreading Recognition Based on SVM and DTAK

  • Author

    He Jun ; Zhang Hua

  • Author_Institution
    Jiangxi Key Lab. of Robot & Welding, NanChang Univ., Nanchang, China
  • fYear
    2010
  • fDate
    18-20 June 2010
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    To enhance recognition accuracy of isolated words identification with small samples in lipreading, SVM is first introduced to act as classifier in this paper. As SVM is based on structural risk minimization, it solves the problem of pattern recognition under small samples, on the other hand, it avoids the unreasonable hypothesis in traditional classifier. To meet the requirement of fixed input feature dimensionality in SVM, several input feature dimensionality normalization methods were discussed and compared. including 3-4-3 data segmenting method, HMM based method and DTAK(Dynamic Time Alignment Kernel) based method. Two experiments were performed on the bimodal database, In the first experiment different input feature normalization algorithm were compared on SVM. Experiments showed that DTAK based normalization achieved the best result. in the second experiments SVM was compared with HMM under different number samples occasion. Experimental results showed that SVM performs better than HMM under small samples.
  • Keywords
    image classification; image recognition; image segmentation; support vector machines; DTAK; SVM; classifier; dynamic time alignment kernel; input feature normalization algorithm; lipreading recognition; structural risk minimization; Artificial neural networks; Educational institutions; Helium; Hidden Markov models; Laboratories; Neural networks; Spatial databases; Speech; Support vector machine classification; Support vector machines;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Biomedical Engineering (iCBBE), 2010 4th International Conference on
  • Conference_Location
    Chengdu
  • ISSN
    2151-7614
  • Print_ISBN
    978-1-4244-4712-1
  • Electronic_ISBN
    2151-7614
  • Type

    conf

  • DOI
    10.1109/ICBBE.2010.5517293
  • Filename
    5517293