• DocumentCode
    3349342
  • Title

    Bimodal information analysis for emotion recognition

  • Author

    Meghjani, Malika ; Ferrie, Frank ; Dudek, Gregory

  • Author_Institution
    Dept. of Electr. & Comput. Eng., McGill Univ., Montreal, QC, Canada
  • fYear
    2009
  • fDate
    7-8 Dec. 2009
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    We present a bimodal information analysis system for automatic emotion recognition. Our approach is based on the analysis of video sequences which combines facial expressions observed visually with acoustic features to automatically recognize five universal emotion classes: anger, disgust, happiness, sadness and surprise. We address the challenges posed during the temporal analysis of the bimodal data and introduce a novel technique for combining the best features of instantaneous and temporal based visual recognition systems. We obtain robust appearance-based visual features which we classify instantaneously and aggregate it temporally to improve the recognition rates when compared to single-frame based instantaneous classification. The performance of the system is further boosted by using the complementary audio information for the bimodal emotion recognition. We combine the two modalities at both feature and score level to compare the respective joint emotion recognition rates. The emotions are instantaneously classified using a support vector machine and sequentially aggregated based on their classification probabilities. This approach is validated on a posed audio-visual database and a natural interactive database. The experiments performed on these databases provide encouraging results with the best combined recognition rate being 82%.
  • Keywords
    acoustic signal processing; audio databases; emotion recognition; face recognition; image classification; image sequences; probability; support vector machines; video signal processing; visual databases; acoustic features; audio-visual database; automatic emotion recognition; bimodal data; bimodal emotion recognition; bimodal information analysis; classification probability; complementary audio information; facial expressions; instantaneous classification; interactive database; recognition rates; support vector machine; temporal analysis; universal emotion classes; video sequences; visual recognition systems; Aggregates; Audio databases; Emotion recognition; Face recognition; Information analysis; Robustness; Spatial databases; Support vector machine classification; Support vector machines; Video sequences;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Applications of Computer Vision (WACV), 2009 Workshop on
  • Conference_Location
    Snowbird, UT
  • ISSN
    1550-5790
  • Print_ISBN
    978-1-4244-5497-6
  • Type

    conf

  • DOI
    10.1109/WACV.2009.5403035
  • Filename
    5403035