• DocumentCode
    3488104
  • Title

    Hidden Markov model-based speech emotion recognition

  • Author

    Schuller, Björn ; Rigoll, Gerhard ; Lang, Manfred

  • Author_Institution
    Inst. for Human-Comput. Commun., Technische Univ. Munchen, Germany
  • Volume
    2
  • fYear
    2003
  • fDate
    6-10 April 2003
  • Abstract
    We introduce speech emotion recognition by use of continuous hidden Markov models. Two methods are propagated and compared. In the first method, a global statistics framework of an utterance is classified by Gaussian mixture models using derived features of the raw pitch and energy contour of the speech signal. A second method introduces increased temporal complexity, applying continuous hidden Markov models considering several states using low-level instantaneous features instead of global statistics. The paper addresses the design of working recognition engines, and results are achieved with respect to the alluded alternatives. A speech corpus consisting of acted and spontaneous emotion samples in German and English is described in detail. Both engines have been tested and trained using this equivalent speech corpus. Results in recognition of seven discrete emotions exceeded 86% recognition rate. In comparison, the judgment of human deciders classifying the same corpus at 79.8% recognition rate was analyzed.
  • Keywords
    Gaussian processes; emotion recognition; hidden Markov models; natural languages; speech recognition; statistical analysis; Gaussian mixture models; continuous hidden Markov models; global statistics framework; speech emotion recognition; speech signal energy contour; speech signal pitch; temporal complexity; Data mining; Emotion recognition; Engines; Hidden Markov models; Humans; Natural languages; Speech analysis; Speech processing; Statistics; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7663-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.2003.1202279
  • Filename
    1202279