DocumentCode
3137102
Title
Lipreading Recognition Based on SVM and DTAK
Author
He Jun ; Zhang Hua
Author_Institution
Jiangxi Key Lab. of Robot & Welding, NanChang Univ., Nanchang, China
fYear
2010
fDate
18-20 June 2010
Firstpage
1
Lastpage
4
Abstract
To enhance recognition accuracy of isolated words identification with small samples in lipreading, SVM is first introduced to act as classifier in this paper. As SVM is based on structural risk minimization, it solves the problem of pattern recognition under small samples, on the other hand, it avoids the unreasonable hypothesis in traditional classifier. To meet the requirement of fixed input feature dimensionality in SVM, several input feature dimensionality normalization methods were discussed and compared. including 3-4-3 data segmenting method, HMM based method and DTAK(Dynamic Time Alignment Kernel) based method. Two experiments were performed on the bimodal database, In the first experiment different input feature normalization algorithm were compared on SVM. Experiments showed that DTAK based normalization achieved the best result. in the second experiments SVM was compared with HMM under different number samples occasion. Experimental results showed that SVM performs better than HMM under small samples.
Keywords
image classification; image recognition; image segmentation; support vector machines; DTAK; SVM; classifier; dynamic time alignment kernel; input feature normalization algorithm; lipreading recognition; structural risk minimization; Artificial neural networks; Educational institutions; Helium; Hidden Markov models; Laboratories; Neural networks; Spatial databases; Speech; Support vector machine classification; Support vector machines;
fLanguage
English
Publisher
ieee
Conference_Titel
Bioinformatics and Biomedical Engineering (iCBBE), 2010 4th International Conference on
Conference_Location
Chengdu
ISSN
2151-7614
Print_ISBN
978-1-4244-4712-1
Electronic_ISBN
2151-7614
Type
conf
DOI
10.1109/ICBBE.2010.5517293
Filename
5517293
Link To Document