DocumentCode :
3432877
Title :
Leveraging valence and activation information via multi-task learning for categorical emotion recognition
Author :
Rui Xia ; Yang Liu
Author_Institution :
Comput. Sci. Dept., Univ. of Texas at Dallas, Richardson, TX, USA
fYear :
2015
fDate :
19-24 April 2015
Firstpage :
5301
Lastpage :
5305
Abstract :
Deep learning technologies have been successfully applied to acoustic emotion recognition lately. In this work, we propose to apply multi-task learning for acoustic emotion recognition based on the Deep Belief Network (DBN) framework. We treat the categorical emotion recognition task as the major task. For the secondary task, we leverage two continuous labels, valence and activation. Two strategies are employed to achieve multi-task learning. First, we map the continuous labels into three categorical labels: low; medium; high, and use classification for the secondary task. Second, we project the continuous labels into [-1; 1] range, and use regression for the secondary task. The combination of the loss functions from the major and secondary tasks is used in the objective function in multi-task learning. After iterative optimization, the values from the last hidden layer are used as features in the backend SVM classifier for emotion classification. Our experimental results show significant improvement over the baseline results using DBN, suggesting the benefit of utilizing additional information in a multi-task learning setup.
Keywords :
acoustic signal processing; belief networks; emotion recognition; iterative methods; learning (artificial intelligence); optimisation; regression analysis; signal classification; support vector machines; DBN; acoustic emotion recognition; backend SVM classifier; categorical emotion recognition; categorical label; continuous label; deep belief network; deep learning technology; emotion classification; iterative optimization; loss functions; multitask learning; regression analysis; Computational modeling; Training; Training data; Visualization; Deep Belief Network; Emotion Recognition; Multi-task learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
Type :
conf
DOI :
10.1109/ICASSP.2015.7178983
Filename :
7178983
Link To Document :
بازگشت