Title :
Emulating human cognitive approach for speech emotion using MLP and GenSofNN
Author :
Kamaruddin, Norhaslinda ; Wahab, Abdul
Author_Institution :
Fac. of Comput. & Math. Sci., MARA Univ. of Technol., Shah Alam, Malaysia
Abstract :
Speech emotion recognition field is growing due to the increasing needs for effective human-computer interaction. There are many approaches in term of features extraction methods coupled with classifiers to obtain optimum performance. However, none can claim superiority as it is very data-dependant and domain-oriented. In this paper, the appropriate sets of features are investigated using segregation method and feature ranking algorithm of Automatic Relevance Determination (ARD) [1]. Two popular classifiers of Multi Layer Perceptron (MLP) [2] and Generic Self-organizing Fuzzy Neural Network (GenSoFNN) [3] are employed to discriminate emotions in the data corpus used in the FAU Aibo Emotion Corpus [4, 5]. The experimental results shows that Mel Frequency Cepstral Coefficient (MFCC) [6] features are able to yield comparable accuracy with baseline result [5]. In addition, it is observed that MLP can perform slightly better than GenSoFNN. Hence, such system envisages that appropriate combination of features extracted with good classifiers is fundamental for the good speech emotion recognition system.
Keywords :
emotion recognition; feature extraction; human computer interaction; multilayer perceptrons; self-organising feature maps; speech recognition; FAU Aibo emotion corpus; GenSofNN; MLP; Mel frequency cepstral coefficient; automatic relevance determination; data corpus; feature extraction method; feature ranking algorithm; generic self-organizing fuzzy neural network; human cognitive approach; human computer interaction; multilayer perceptron; segregation method; speech emotion recognition system; Accuracy; Emotion recognition; Fuzzy neural networks; Mel frequency cepstral coefficient; Neurons; Speech; Speech recognition; Automatic Relevance Determination; Generic Self-organizing Fuzzy Neural Network; Mel Frequency Cepstral Coefficient; Multi Layer Perceptron; speech emotion recognition;
Conference_Titel :
Information and Communication Technology for the Muslim World (ICT4M), 2013 5th International Conference on
Conference_Location :
Rabat
Print_ISBN :
978-1-4799-0134-0
DOI :
10.1109/ICT4M.2013.6518885