Title :
Automatic Speech Emotion Recognition: A survey
Author :
Chandrasekar, P. ; Chapaneri, Santosh ; Jayaswal, Deepak
Author_Institution :
Dept. of Electron. & Telecommun. Eng., Univ. of Mumbai, Mumbai, India
Abstract :
The area of Automatic Speech Emotion Recognition (ASER) has garnered a lot of interest among researchers. The framework of ASER typically includes three steps viz. speech feature extraction, dimensionality reduction and feature classification. At the base of this framework lies the design and recording of the database of emotional states through which the most popular set of emotions-happiness, sadness, anger, fear, disgust, boredom (which are typically called as `archetypal emotions´) and neutral among others have been obtained. This paper surveys the extent of work done in this field especially highlighting the three steps of the ASER framework. Starting with the different languages that have been explored till date for creating the databases, this paper attempts to categorize the features that have been typically extracted, enlist the dimensionality reduction techniques that have been chosen and discuss the pros and cons, if any, of the feature classifiers that have been modelled.
Keywords :
data reduction; emotion recognition; feature extraction; speech recognition; ASER framework; automatic speech emotion recognition; dimensionality reduction techniques; emotional state database; feature classification; feature classifiers; speech feature extraction; Databases; Emotion recognition; Feature extraction; Hidden Markov models; Mel frequency cepstral coefficient; Speech; Speech recognition; dimensionality reduction; feature classification; feature extraction;
Conference_Titel :
Circuits, Systems, Communication and Information Technology Applications (CSCITA), 2014 International Conference on
Conference_Location :
Mumbai
DOI :
10.1109/CSCITA.2014.6839284