DocumentCode :
3280354
Title :
Real-time lip trackers for use in audio-visual speech recognition
Author :
Kaucic, Robert ; Reynard, David ; Blake, Andrew
Author_Institution :
Robotics Res. Group, Oxford Univ., UK
fYear :
1996
fDate :
35397
Firstpage :
42430
Lastpage :
42435
Abstract :
Human speech is inherently multi-modal, consisting of both audio and visual components. The increased computing power of general-purpose workstations and PCs has made it possible to extract visual features in real-time that can be used to supplement acoustic-only speech recognisers, enabling robust recognition of speech in the presence of acoustic noise. In order to achieve real-time performance, previous work has utilised a dynamic contour framework with cosmetically assisted lips. It is shown that unadorned lips can be suitably tracked in real-time without cosmetic assistance. In addition, a coupled head-lip tracker is presented which provides accurate, stable, lip tracking throughout a range of head positions and pose. As well as improving tracking performance, the coupling of the head tracker to the lip tracker provides an opportunity for the extraction of visual recognition features that are invariant to head position-a must for audio-visual recognition of unconstrained speakers
Keywords :
real-time systems; acoustic noise; audiovisual speech recognition; cosmetically assisted lips; coupled head-lip tracker; dynamic contour framework; general-purpose workstations; head positions; multi-modal human speech; personal computers; pose; real-time lip trackers; robust recognition; tracking performance; unadorned lips; unconstrained speakers; visual feature extraction;
fLanguage :
English
Publisher :
iet
Conference_Titel :
Integrated Audio-Visual Processing for Recognition, Synthesis and Communication (Digest No: 1996/213), IEE Colloquium on
Conference_Location :
London
Type :
conf
DOI :
10.1049/ic:19961147
Filename :
645680
Link To Document :
بازگشت