DocumentCode :
3630209
Title :
Selection of Emotionally Salient Audio-Visual Features for Modeling Human Evaluations of Synthetic Character Emotion Displays
Author :
Emily Mower;Maja J. Mataric;Shrikanth Narayanan
Author_Institution :
Dept. of Electr. Eng., Univ. of Southern California, Los Angeles, CA
fYear :
2008
Firstpage :
190
Lastpage :
195
Abstract :
Computer simulated avatars and humanoid robots have an increasingly prominent place in today´s world. Acceptance of these synthetic characters depends on their ability to properly and recognizably convey basic emotion states to a user population. This study presents an analysis of audio-visual features that can be used to predict user evaluations of synthetic character emotion displays. These features include prosodic, spectral, and semantic properties of audio signals in addition to FACS-inspired video features. The goal of this paper is to identify the audio-visual features that explain the variance in the emotional evaluations of naive listeners through the utilization of information gain feature selection in conjunction with support vector machines. These results suggest that there exists an emotionally salient subset of the audio-visual feature space. The features that contribute most to the explanation of evaluator variance are the prior knowledge audio statistics (e.g., average valence rating), the high energy band spectral components, and the quartile pitch range. This feature subset should be correctly modeled and implemented in the design of synthetic expressive displays to convey the desired emotions.
Keywords :
"Humans","Auditory displays","Computer displays","Computational modeling","Computer simulation","Avatars","Humanoid robots","Emotion recognition","Character recognition","Support vector machines"
Publisher :
ieee
Conference_Titel :
Multimedia, 2008. ISM 2008. Tenth IEEE International Symposium on
Type :
conf
DOI :
10.1109/ISM.2008.78
Filename :
4741168
Link To Document :
بازگشت