مرکز منطقه ای اطلاع رساني علوم و فناوري - Extraction of visual features for lipreading

DocumentCode :

1560327

Title :

Extraction of visual features for lipreading

Author :

Matthews, Iain ; Cootes, Timothy F. ; Bangham, J. Andrew ; Cox, Stephen ; Harvey, Richard

Author_Institution :

Robotics Inst., Carnegie Mellon Univ., Pittsburgh, PA, USA

Volume :

Issue :

fYear :

2002

fDate :

2/1/2002 12:00:00 AM

Firstpage :

198

Lastpage :

213

Abstract :

The multimodal nature of speech is often ignored in human-computer interaction, but lip deformations and other body motion, such as those of the head, convey additional information. We integrate speech cues from many sources and this improves intelligibility, especially when the acoustic signal is degraded. The paper shows how this additional, often complementary, visual speech information can be used for speech recognition. Three methods for parameterizing lip image sequences for recognition using hidden Markov models are compared. Two of these are top-down approaches that fit a model of the inner and outer lip contours and derive lipreading features from a principal component analysis of shape or shape and appearance, respectively. The third, bottom-up, method uses a nonlinear scale-space analysis to form features directly from the pixel intensity. All methods are compared on a multitalker visual speech recognition task of isolated letters

Keywords :

computer vision; feature extraction; filtering theory; hidden Markov models; image sequences; mathematical morphology; principal component analysis; speech recognition; acoustic signal; active appearance model; bottom-up method; computer vision; connected-set morphology; hidden Markov models; human-computer interaction; intelligibility; isolated letters; lip deformations; lip image sequences; lipreading; multitalker visual speech recognition task; nonlinear scale-space analysis; principal component analysis; speech recognition; top-down approaches; visual features extraction; visual speech information; Data mining; Degradation; Feature extraction; Head; Hidden Markov models; Image recognition; Image sequences; Principal component analysis; Shape; Speech recognition;

fLanguage :

English

Journal_Title :

Pattern Analysis and Machine Intelligence, IEEE Transactions on

Publisher :

ieee

ISSN :

0162-8828

Type :

jour

DOI :

10.1109/34.982900

Filename :

982900

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1560327