Title :
Audiovisual Behavior Modeling by Combined Feature Spaces
Author :
Schuller, Bjorn ; Arsic, D. ; Rigoll, Gerhard ; Wimmer, Manuel ; Radig, Bernd
Author_Institution :
Inst. for Human-Machine Commun., Tech. Univ. Munchen, Germany
Abstract :
Great interest is recently shown in behavior modeling, especially in public surveillance tasks. In general it is agreed upon the benefits of use of several input cues as audio and video. Yet, synchronization and fusion of these information sources remains the main challenge. We therefore show results for a feature space combination, which allows for overall feature space optimization. Audio and video features are thereby firstly derived as low-level-descriptors. Synchronization and feature combination is achieved by multivariate time-series analysis. Test-runs on a database of aggressive, cheerful, intoxicated, nervous, neutral, and tired behavior in an airplane situation show a significant improvement over each single modality.
Keywords :
audio-visual systems; emotion recognition; time series; audiovisual behavior modeling; behavior modeling; feature space optimization; information fusion; low-level-descriptors; multivariate time-series analysis; public surveillance tasks; Airplanes; Emotion recognition; Informatics; Man machine systems; Performance analysis; Spatial databases; Speech analysis; Statistical analysis; Surveillance; Time series analysis; Affective Computing; Audiovisual Emotion Recognition; Feature Fusion; Synergistic Multimodality;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
Print_ISBN :
1-4244-0727-3
DOI :
10.1109/ICASSP.2007.366340