DocumentCode
3413603
Title
Multi-party focus of attention recognition in meetings from head pose and multimodal contextual cues
Author
Ba, Sileye O. ; Odobez, Jean-Marc
Author_Institution
IDIAP Res. Inst., Lausanne
fYear
2008
fDate
March 31 2008-April 4 2008
Firstpage
2221
Lastpage
2224
Abstract
This paper presents investigations on visual focus of attention (VFOA) recognition in meetings from audio-visual perceptual cues. Rather than independently recognizing the VFOA of each participant from his own head pose, we propose to recognize participants´ VFOA jointly in order to introduce context dependent interaction models that relates to group activity and the social dynamics of communication. To this end, we designed an input-output hidden Markov model (IOHMM), whose hidden states are the joint VFOA of all participants, and whose main observations are the head poses. Interaction models are introduced in the form of contextual cues that affect the temporal evolution of the joint VFOA sequence, allowing us to model group dynamics that accounts for people´s tendency to share the same focus, or to have their VFOA driven by contextual cues such as slide activity or the participant speaking activity. The model is rigorously evaluated on a publicly available dataset of 4 real meetings of 23min on average, showing an overall 10% relative performance increase w.r.t. the independent recognition case.
Keywords
hidden Markov models; pose estimation; speech recognition; visual perception; attention recognition; audio-visual perceptual cues; head pose; hidden Markov model; multimodal contextual cues; multiparty visual focus; speaking activity; Computer science; Content management; Context modeling; Feedback; Globalization; Government; Hidden Markov models; Information management; Speech recognition; Statistical analysis; Visual focus of attention; contextual cues; head pose; meeting analysis; multi-modal;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location
Las Vegas, NV
ISSN
1520-6149
Print_ISBN
978-1-4244-1483-3
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2008.4518086
Filename
4518086
Link To Document