• DocumentCode
    3638071
  • Title

    Multi-modal Emotion Recognition Using Canonical Correlations and Acoustic Features

  • Author

    Rok Gajsek;Vitomir Štruc;France Mihelic

  • Author_Institution
    Fac. of Electr. Eng., Univ. of Ljubljana, Ljubljana, Slovenia
  • fYear
    2010
  • Firstpage
    4133
  • Lastpage
    4136
  • Abstract
    The information of the psycho-physical state of the subject is becoming a valuable addition to the modern audio or video recognition systems. As well as enabling a better user experience, it can also assist in superior recognition accuracy of the base system. In the article, we present our approach to multi-modal (audio-video) emotion recognition system. For audio sub-system, a feature set comprised of prosodic, spectral and cepstrum features is selected and support vector classifier is used to produce the scores for each emotional category. For video sub-system a novel approach is presented, which does not rely on the tracking of specific facial landmarks and thus, eliminates the problems usually caused, if the tracking algorithm fails at detecting the correct area. The system is evaluated on the interface database and the recognition accuracy of our audio-video fusion is compared to the published results in the literature.
  • Keywords
    "Emotion recognition","Video sequences","Correlation","Feature extraction","Databases","Face","Support vector machines"
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition (ICPR), 2010 20th International Conference on
  • ISSN
    1051-4651
  • Print_ISBN
    978-1-4244-7542-1
  • Type

    conf

  • DOI
    10.1109/ICPR.2010.1005
  • Filename
    5597732