Speaker Independent Automatic Emotion Recognition from Speech: A Comparison of MFCCs and Discrete Wavelet Transforms

Author

Firoz Shah, A. ; Vimal Krishnan, V.R. ; Raji Sukumar, A. ; Jayakumar, Athulya ; Babu Anto, P.

Author_Institution

Sch. of Inf. Sci. & Technol., Kannur Univ., Kannur, India

fYear

2009

fDate

27-28 Oct. 2009

Firstpage

528

Lastpage

531

Abstract

Automatic Emotion Recognition (AER) from speech is one of the most interested research domains for the scientific world. AER simply means to make a machine able to recognize the different emotions from speech. We have created and analyzed an elicited database consisting of 700 utterances under four different emotional classes such as neutral happy sad and anger. Malayalam (One of the south Indian languages) was used to conduct the experiment. We have used a database proportion of 80:20 for the training and testing purpose. We have analyzed the emotional speech corpus recorded by using both Discrete Wavelet Transforms (DWTs) and Mel Frequency Cepstral Coefficients (MFCCs) and obtained an overall recognition accuracy of 68.5% and 55% respectively. artificial neural network was used for the classification and recognition purpose.

Keywords

audio databases; discrete wavelet transforms; emotion recognition; neural nets; Indian languages; MFCC; Mel frequency cepstral coefficients; artificial neural network; discrete wavelet transforms; emotional speech corpus; speaker independent automatic emotion recognition; Automatic speech recognition; Cepstral analysis; Data analysis; Databases; Discrete wavelet transforms; Emotion recognition; Speech analysis; Speech recognition; Testing; Wavelet analysis;

fLanguage

English

Publisher

ieee

Conference_Titel

Advances in Recent Technologies in Communication and Computing, 2009. ARTCom '09. International Conference on

Conference_Location

Kottayam, Kerala

Print_ISBN

978-1-4244-5104-3

Electronic_ISBN

978-0-7695-3845-7

Type

conf

DOI

10.1109/ARTCom.2009.231

Filename

5329196