Title :
Bimodal information analysis for emotion recognition
Author :
Meghjani, Malika ; Ferrie, Frank ; Dudek, Gregory
Author_Institution :
Dept. of Electr. & Comput. Eng., McGill Univ., Montreal, QC, Canada
Abstract :
We present a bimodal information analysis system for automatic emotion recognition. Our approach is based on the analysis of video sequences which combines facial expressions observed visually with acoustic features to automatically recognize five universal emotion classes: anger, disgust, happiness, sadness and surprise. We address the challenges posed during the temporal analysis of the bimodal data and introduce a novel technique for combining the best features of instantaneous and temporal based visual recognition systems. We obtain robust appearance-based visual features which we classify instantaneously and aggregate it temporally to improve the recognition rates when compared to single-frame based instantaneous classification. The performance of the system is further boosted by using the complementary audio information for the bimodal emotion recognition. We combine the two modalities at both feature and score level to compare the respective joint emotion recognition rates. The emotions are instantaneously classified using a support vector machine and sequentially aggregated based on their classification probabilities. This approach is validated on a posed audio-visual database and a natural interactive database. The experiments performed on these databases provide encouraging results with the best combined recognition rate being 82%.
Keywords :
acoustic signal processing; audio databases; emotion recognition; face recognition; image classification; image sequences; probability; support vector machines; video signal processing; visual databases; acoustic features; audio-visual database; automatic emotion recognition; bimodal data; bimodal emotion recognition; bimodal information analysis; classification probability; complementary audio information; facial expressions; instantaneous classification; interactive database; recognition rates; support vector machine; temporal analysis; universal emotion classes; video sequences; visual recognition systems; Aggregates; Audio databases; Emotion recognition; Face recognition; Information analysis; Robustness; Spatial databases; Support vector machine classification; Support vector machines; Video sequences;
Conference_Titel :
Applications of Computer Vision (WACV), 2009 Workshop on
Conference_Location :
Snowbird, UT
Print_ISBN :
978-1-4244-5497-6
DOI :
10.1109/WACV.2009.5403035