DocumentCode
2351913
Title
Speaker Independent Automatic Emotion Recognition from Speech: A Comparison of MFCCs and Discrete Wavelet Transforms
Author
Firoz Shah, A. ; Vimal Krishnan, V.R. ; Raji Sukumar, A. ; Jayakumar, Athulya ; Babu Anto, P.
Author_Institution
Sch. of Inf. Sci. & Technol., Kannur Univ., Kannur, India
fYear
2009
fDate
27-28 Oct. 2009
Firstpage
528
Lastpage
531
Abstract
Automatic Emotion Recognition (AER) from speech is one of the most interested research domains for the scientific world. AER simply means to make a machine able to recognize the different emotions from speech. We have created and analyzed an elicited database consisting of 700 utterances under four different emotional classes such as neutral happy sad and anger. Malayalam (One of the south Indian languages) was used to conduct the experiment. We have used a database proportion of 80:20 for the training and testing purpose. We have analyzed the emotional speech corpus recorded by using both Discrete Wavelet Transforms (DWTs) and Mel Frequency Cepstral Coefficients (MFCCs) and obtained an overall recognition accuracy of 68.5% and 55% respectively. artificial neural network was used for the classification and recognition purpose.
Keywords
audio databases; discrete wavelet transforms; emotion recognition; neural nets; Indian languages; MFCC; Mel frequency cepstral coefficients; artificial neural network; discrete wavelet transforms; emotional speech corpus; speaker independent automatic emotion recognition; Automatic speech recognition; Cepstral analysis; Data analysis; Databases; Discrete wavelet transforms; Emotion recognition; Speech analysis; Speech recognition; Testing; Wavelet analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Advances in Recent Technologies in Communication and Computing, 2009. ARTCom '09. International Conference on
Conference_Location
Kottayam, Kerala
Print_ISBN
978-1-4244-5104-3
Electronic_ISBN
978-0-7695-3845-7
Type
conf
DOI
10.1109/ARTCom.2009.231
Filename
5329196
Link To Document