Title :
Lip temporal pattern analysis for automatic visual speech recognition
Author :
Xie, Let ; Cai, Xiuli ; Fu, Zhonghua ; Jiang, Dongniei ; Zhao, Rongchun
Author_Institution :
Sch. of Comput. Sci., Northwestern Polytech. Univ., Xi´´an, China
fDate :
31 Aug.-4 Sept. 2004
Abstract :
This paper presents a novel approach to processing temporal lip motion information for dynamic visual feature extraction in visual speech recognition. The long-time lip temporal patterns (LipTRAPs) of visual phonemes are introduced to analyze the nature of lip shape changes when uttering speech. A dynamic visual feature is also proposed based on the LipTRAPs. Visual speech recognition experiments on a connected-digits task show that the LipTRAP feature can yield significant WRR improvements than conventional delta features.
Keywords :
feature extraction; speech recognition; automatic visual speech recognition; dynamic visual feature extraction; lip temporal pattern analysis; temporal lip motion information; Acoustic sensors; Computer science; Data mining; Feature extraction; Mouth; Pattern analysis; Shape; Speech analysis; Speech processing; Speech recognition;
Conference_Titel :
Signal Processing, 2004. Proceedings. ICSP '04. 2004 7th International Conference on
Print_ISBN :
0-7803-8406-7
DOI :
10.1109/ICOSP.2004.1452760