مرکز منطقه ای اطلاع رساني علوم و فناوري - Lipreading Recognition Based on SVM and DTAK

DocumentCode :

3137102

Title :

Lipreading Recognition Based on SVM and DTAK

Author :

He Jun ; Zhang Hua

Author_Institution :

Jiangxi Key Lab. of Robot & Welding, NanChang Univ., Nanchang, China

fYear :

2010

fDate :

18-20 June 2010

Firstpage :

Lastpage :

Abstract :

To enhance recognition accuracy of isolated words identification with small samples in lipreading, SVM is first introduced to act as classifier in this paper. As SVM is based on structural risk minimization, it solves the problem of pattern recognition under small samples, on the other hand, it avoids the unreasonable hypothesis in traditional classifier. To meet the requirement of fixed input feature dimensionality in SVM, several input feature dimensionality normalization methods were discussed and compared. including 3-4-3 data segmenting method, HMM based method and DTAK(Dynamic Time Alignment Kernel) based method. Two experiments were performed on the bimodal database, In the first experiment different input feature normalization algorithm were compared on SVM. Experiments showed that DTAK based normalization achieved the best result. in the second experiments SVM was compared with HMM under different number samples occasion. Experimental results showed that SVM performs better than HMM under small samples.

Keywords :

image classification; image recognition; image segmentation; support vector machines; DTAK; SVM; classifier; dynamic time alignment kernel; input feature normalization algorithm; lipreading recognition; structural risk minimization; Artificial neural networks; Educational institutions; Helium; Hidden Markov models; Laboratories; Neural networks; Spatial databases; Speech; Support vector machine classification; Support vector machines;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Bioinformatics and Biomedical Engineering (iCBBE), 2010 4th International Conference on

Conference_Location :

Chengdu

ISSN :

2151-7614

Print_ISBN :

978-1-4244-4712-1

Electronic_ISBN :

2151-7614

Type :

conf

DOI :

10.1109/ICBBE.2010.5517293

Filename :

5517293

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3137102