Title :
An audio-video front-end for multimedia applications
Author :
Zotkin, Dmitry ; Duraiswami, Ramani ; Davis, Larry ; Taoglu, Isimail Hari
Author_Institution :
Maryland Univ., College Park, MD, USA
Abstract :
Applications such as video gaming, virtual reality, multimodal user interfaces and videoconferencing, require systems that can locate and track persons in a room through a combination of visual and audio cues, enhance the sound that they produce, and perform identification. We describe the development of a particular multimodal sensor fusion system that is portable, runs in real time and achieves these objectives. The system employs novel algorithms for acoustical source location, video-based person tracking and overall system control, which are also described
Keywords :
computer vision; multimedia systems; real-time systems; sensor fusion; video cameras; acoustical source location; audio cues; audio-video front-end; multimedia applications; multimodal sensor fusion system; multimodal user interfaces; real time; sound; video gaming; video-based person tracking; videoconferencing; virtual reality; visual cues; Acoustic noise; Application software; Cameras; Microphones; Position measurement; Real time systems; Speech recognition; User interfaces; Videoconference; Working environment noise;
Conference_Titel :
Systems, Man, and Cybernetics, 2000 IEEE International Conference on
Conference_Location :
Nashville, TN
Print_ISBN :
0-7803-6583-6
DOI :
10.1109/ICSMC.2000.885945