Title :
String-based audiovisual fusion of behavioural events for the assessment of dimensional affect
Author :
Eyben, Florian ; Wöllmer, Martin ; Valstar, Michel F. ; Gunes, Hatice ; Schuller, Björn ; Pantic, Maja
Author_Institution :
Inst. for Human-Machine Commun., Tech. Univ. Munchen, Munich, Germany
Abstract :
The automatic assessment of affect is mostly based on feature-level approaches, such as distances between facial points or prosodic and spectral information when it comes to audiovisual analysis. However, it is known and intuitive that behavioural events such as smiles, head shakes or laughter and sighs also bear highly relevant information regarding a subject´s affective display. Accordingly, we propose a novel string-based prediction approach to fuse such events and to predict human affect in a continuous dimensional space. Extensive analysis and evaluation has been conducted using the newly released SEMAINE database of human-to-agent communication. For a thorough understanding of the obtained results, we provide additional benchmarks by more conventional feature-level modelling, and compare these and the string-based approach to fusion of signal-based features and string-based events. Our experimental results show that the proposed string-based approach is the best performing approach for automatic prediction of Valence and Expectation dimensions, and improves prediction performance for the other dimensions when combined with at least acoustic signal-based features.
Keywords :
audio databases; audio signal processing; audio-visual systems; behavioural sciences; feature extraction; video signal processing; visual databases; SEMAINE database; Valence dimension; behavioural event; continuous dimensional space; expectation dimension; facial point; feature level approach; feature level modelling; human to agent communication; signal based feature; spectral information; string based audiovisual fusion; string based event; string based prediction approach; Databases; Face; Feature extraction; Hidden Markov models; Pixel; Speech; Visualization;
Conference_Titel :
Automatic Face & Gesture Recognition and Workshops (FG 2011), 2011 IEEE International Conference on
Conference_Location :
Santa Barbara, CA
Print_ISBN :
978-1-4244-9140-7
DOI :
10.1109/FG.2011.5771417