Title :
Robust Canonical Correlation Analysis: Audio-visual fusion for learning continuous interest
Author :
Nicolaou, Mihalis A. ; Panagakis, Yannis ; Zafeiriou, Stefanos ; Pantic, Maja
Author_Institution :
Dept. of Comput., Imperial Coll. London, London, UK
Abstract :
The problem of automatically estimating the interest level of a subject has been gaining attention by researchers, mostly due to the vast applicability of interest detection. In this work, we obtain a set of continuous interest annotations for the SE-MAINE database, which we analyse also in terms of emotion dimensions such as valence and arousal. Most importantly, we propose a robust variant of Canonical Correlation Analysis (RCCA) for performing audio-visual fusion, which we apply to the prediction of interest. RCCA recovers a low-rank subspace which captures the correlations of fused modalities, while isolating gross errors in the data without making any assumptions regarding Gaussianity. We experimentally show that RCCA is more appropriate than other standard fusion techniques (such as l2-CCA and feature-level fusion), since it both captures interactions between modalities while also decontaminating the obtained subspace from errors which are dominant in real-world problems.
Keywords :
audio signal processing; correlation methods; emotion recognition; speech processing; video signal processing; SE-MAINE database; audio-visual fusion; continuous interest annotation; continuous interest learning; emotion dimension; interest level estimation; interest prediction; robust canonical correlation analysis; Affective computing; Computational modeling; Conferences; Correlation; Databases; Psychology; Robustness; Audio-visual Fusion; Emotion Recognition; Interest Detection; Multi-modal Fusion;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
DOI :
10.1109/ICASSP.2014.6853852