Title :
Lipreading Recognition Based on SVM and DTAK
Author :
He Jun ; Zhang Hua
Author_Institution :
Jiangxi Key Lab. of Robot & Welding, NanChang Univ., Nanchang, China
Abstract :
To enhance recognition accuracy of isolated words identification with small samples in lipreading, SVM is first introduced to act as classifier in this paper. As SVM is based on structural risk minimization, it solves the problem of pattern recognition under small samples, on the other hand, it avoids the unreasonable hypothesis in traditional classifier. To meet the requirement of fixed input feature dimensionality in SVM, several input feature dimensionality normalization methods were discussed and compared. including 3-4-3 data segmenting method, HMM based method and DTAK(Dynamic Time Alignment Kernel) based method. Two experiments were performed on the bimodal database, In the first experiment different input feature normalization algorithm were compared on SVM. Experiments showed that DTAK based normalization achieved the best result. in the second experiments SVM was compared with HMM under different number samples occasion. Experimental results showed that SVM performs better than HMM under small samples.
Keywords :
image classification; image recognition; image segmentation; support vector machines; DTAK; SVM; classifier; dynamic time alignment kernel; input feature normalization algorithm; lipreading recognition; structural risk minimization; Artificial neural networks; Educational institutions; Helium; Hidden Markov models; Laboratories; Neural networks; Spatial databases; Speech; Support vector machine classification; Support vector machines;
Conference_Titel :
Bioinformatics and Biomedical Engineering (iCBBE), 2010 4th International Conference on
Conference_Location :
Chengdu
Print_ISBN :
978-1-4244-4712-1
Electronic_ISBN :
2151-7614
DOI :
10.1109/ICBBE.2010.5517293