Title :
Just-in-time multimodal association and fusion from home entertainment
Author :
Korchagin, Danil ; Motlicek, Petr ; Duffner, Stefan ; Bourlard, Hervé
Author_Institution :
Idiap Res. Inst., Martigny, Switzerland
Abstract :
In this paper, we describe a real-time multimodal analysis system with just-in-time multimodal association and fusion for a living room environment, where multiple people may enter, interact and leave the observable world with no constraints. It comprises detection and tracking of up to 4 faces, detection and localisation of verbal and paralinguistic events, their association and fusion. The system is designed to be used in open, unconstrained environments like in next generation video conferencing systems that automatically "orchestrate" the transmitted video streams to improve the overall experience of interaction between spatially separated families and friends. Performance levels achieved to date on hand-labelled dataset have shown sufficient reliability at the same time as fulfilling real-time processing requirements.
Keywords :
reliability; teleconferencing; video communication; video streaming; fusion; hand-labelled dataset; home entertainment; just-in-time multimodal association; living room environment; next generation video conferencing; paralinguistic events; real-time multimodal analysis; reliability; transmitted video streams; verbal events; Acoustics; Arrays; Microphones; Real time systems; Speech recognition; Streaming media; Target tracking; Multimodal signal processing; association rules; data analysis; sensor fusion;
Conference_Titel :
Multimedia and Expo (ICME), 2011 IEEE International Conference on
Conference_Location :
Barcelona
Print_ISBN :
978-1-61284-348-3
Electronic_ISBN :
1945-7871
DOI :
10.1109/ICME.2011.6012242