• DocumentCode
    1971973
  • Title

    Automatic emotion detection in speech using mel frequency cesptral coefficients

  • Author

    Bedoya-Jaramillo, S. ; Belalcazar-Bolaños, E. ; Villa-Cañas, T. ; Orozco-Arroyave, J.R. ; Arias-Londono, J.D. ; Vargas-Bonilla, J.F.

  • Author_Institution
    Dept. de Ing. Electron. y Telecomun., Univ. de Antioquia, Antioquia, Colombia
  • fYear
    2012
  • fDate
    12-14 Sept. 2012
  • Firstpage
    62
  • Lastpage
    65
  • Abstract
    Emotional states produce physiological alterations in the vocal tract introducing variability in the acoustic parameters of speech. Emotion recognition in speech can be used in human-machine interaction applications, speaker verification, analysis of neurological disorders and psychological diagnostic tools. This paper proposes the use of Mel Frequency Cesptral Coefficients (MFCC) for automatic detection of emotions in running speech. Experiments were conducted on the Berlin emotional speech database for a three- class problem (anger, boredom and neutral emotional states). In order to evaluate the discrimination ability of the features three different classifiers were implemented: k-nearest neighbor, Bayesian Linear and quadratic. The highest accuracy results are obtained when neutral and anger emotions are evaluated.
  • Keywords
    acoustic signal processing; emotion recognition; man-machine systems; speech recognition; Bayesian linear; Bayesian quadratic; Berlin emotional speech database; Mel frequency cesptral coefficient; acoustic parameter; anger emotional state; automatic emotion detection; boredom emotional state; discrimination ability; emotion recognition; human-machine interaction; k-nearest neighbor; neurological disorder; neutral emotional state; physiological alteration; psychological diagnostic tool; speaker verification; three-class problem; vocal tract; Bayesian methods; Databases; Emotion recognition; Mel frequency cepstral coefficient; Speech; Speech recognition; Vectors; MFCC; feature selection; running speech; speech emotion recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Image, Signal Processing, and Artificial Vision (STSIVA), 2012 XVII Symposium of
  • Conference_Location
    Antioquia
  • Print_ISBN
    978-1-4673-2759-6
  • Type

    conf

  • DOI
    10.1109/STSIVA.2012.6340558
  • Filename
    6340558