• DocumentCode
    3413603
  • Title

    Multi-party focus of attention recognition in meetings from head pose and multimodal contextual cues

  • Author

    Ba, Sileye O. ; Odobez, Jean-Marc

  • Author_Institution
    IDIAP Res. Inst., Lausanne
  • fYear
    2008
  • fDate
    March 31 2008-April 4 2008
  • Firstpage
    2221
  • Lastpage
    2224
  • Abstract
    This paper presents investigations on visual focus of attention (VFOA) recognition in meetings from audio-visual perceptual cues. Rather than independently recognizing the VFOA of each participant from his own head pose, we propose to recognize participants´ VFOA jointly in order to introduce context dependent interaction models that relates to group activity and the social dynamics of communication. To this end, we designed an input-output hidden Markov model (IOHMM), whose hidden states are the joint VFOA of all participants, and whose main observations are the head poses. Interaction models are introduced in the form of contextual cues that affect the temporal evolution of the joint VFOA sequence, allowing us to model group dynamics that accounts for people´s tendency to share the same focus, or to have their VFOA driven by contextual cues such as slide activity or the participant speaking activity. The model is rigorously evaluated on a publicly available dataset of 4 real meetings of 23min on average, showing an overall 10% relative performance increase w.r.t. the independent recognition case.
  • Keywords
    hidden Markov models; pose estimation; speech recognition; visual perception; attention recognition; audio-visual perceptual cues; head pose; hidden Markov model; multimodal contextual cues; multiparty visual focus; speaking activity; Computer science; Content management; Context modeling; Feedback; Globalization; Government; Hidden Markov models; Information management; Speech recognition; Statistical analysis; Visual focus of attention; contextual cues; head pose; meeting analysis; multi-modal;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
  • Conference_Location
    Las Vegas, NV
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-1483-3
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2008.4518086
  • Filename
    4518086