• DocumentCode
    2963876
  • Title

    Acoustic emotion recognition: A benchmark comparison of performances

  • Author

    Schuller, Björn ; Vlasenko, Bogdan ; Eyben, Florian ; Rigoll, Gerhard ; Wendemuth, Andreas

  • Author_Institution
    Inst. for Human-Machine Commun., Tech. Univ. Munchen, Munich, Germany
  • fYear
    2009
  • fDate
    Nov. 13 2009-Dec. 17 2009
  • Firstpage
    552
  • Lastpage
    557
  • Abstract
    In the light of the first challenge on emotion recognition from speech we provide the largest-to-date benchmark comparison under equal conditions on nine standard corpora in the field using the two pre-dominant paradigms: modeling on a frame-level by means of hidden Markov models and supra-segmental modeling by systematic feature brute-forcing. Investigated corpora are the ABC, AVIC, DES, EMO-DB, eNTERFACE, SAL, SmartKom, SUSAS, and VAM databases. To provide better comparability among sets, we additionally cluster each database´s emotions into binary valence and arousal discrimination tasks. In the result large differences are found among corpora that mostly stem from naturalistic emotions and spontaneous speech vs. more prototypical events. Further, supra-segmental modeling proves significantly beneficial on average when several classes are addressed at a time.
  • Keywords
    emotion recognition; hidden Markov models; ABC databases; AVIC databases; DES databases; EMO-DB databases; SAL databases; SUSAS databases; SmartKom databases; VAM databases; acoustic emotion recognition; eNTERFACE databases; hidden Markov models; systematic feature brute-forcing; Benchmark testing; Communication standards; Emotion recognition; Hidden Markov models; Man machine systems; Pattern recognition; Prototypes; Spatial databases; Speech analysis; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition & Understanding, 2009. ASRU 2009. IEEE Workshop on
  • Conference_Location
    Merano
  • Print_ISBN
    978-1-4244-5478-5
  • Electronic_ISBN
    978-1-4244-5479-2
  • Type

    conf

  • DOI
    10.1109/ASRU.2009.5372886
  • Filename
    5372886