Title :
Data Fusion for Geometrical and Pixel Based Lip Feature
Author :
Wang Mengjun ; Li Gang
Author_Institution :
Sch. of Inf. Eng., HeBei Univ. of Technol., Tianjin, China
Abstract :
Lipreading is applied to synthesize speech for the speech-impaired people. To get a higher recognition result, data fusion with weighting coefficients at feature level is used to integrate the lip information from different kinds of lip features. Experiments are carried out based on HMM with different states and Gaussian mixture component in a small database for speaker-dependent case. Experiment results showed that the integrated discriminate vector after feature fusion obtains the information from the Geometrical feature vector of lip region and the DCT coefficients of lip´ ROI. With best weighting coefficients m: n=1.5:1, the recognition rate are improved by as much as 5.02% and 8.37%, respectively.
Keywords :
Gaussian processes; discrete cosine transforms; feature extraction; handicapped aids; hidden Markov models; sensor fusion; speech synthesis; DCT coefficients; Gaussian mixture component; HMM; data fusion; geometrical based lip feature; integrated discriminate vector; lipreading; pixel based lip feature; speech synthesis; speech-impaired people; Discrete cosine transforms; Feature extraction; Hidden Markov models; Image sequences; Pixel; Speech; Visualization; Hidden Markov Model; data fusion; geometrical based feature vector; pixel based feature vector; weighting combination;
Conference_Titel :
Intelligence Information Processing and Trusted Computing (IPTC), 2010 International Symposium on
Conference_Location :
Huanggang
Print_ISBN :
978-1-4244-8148-4
Electronic_ISBN :
978-0-7695-4196-9
DOI :
10.1109/IPTC.2010.35