DocumentCode
3071912
Title
Application of neural networks in emotional speech recognition
Author
Bojanic, Milana ; Crnojevic, Vladimir ; Delic, Vlado
Author_Institution
Fac. of Tech. Sci., Univ. of Novi Sad, Novi Sad, Serbia
fYear
2012
fDate
20-22 Sept. 2012
Firstpage
223
Lastpage
226
Abstract
Emotional speech recognition (ESR) from the aspect of human-machine interaction (HCI) is a prerequisite for the framework of interacting partners within the HCI. This paper addresses the application of neural network (NN) in ESR. The performance of NN is tested using three different feature sets which are basis for ESR: prosodic features, spectral features and a set of their combination. The results of these feature sets are compared using several network topologies and two training algorithms. It has been shown that using joint prosodic-spectral feature set as input to three layer feed-forward NN trained with back-propagation algorithm has the best performance in 5-class emotional speech recognition task.
Keywords
backpropagation; emotion recognition; human computer interaction; neural nets; speech recognition; ESR; HCI; NN; backpropagation algorithm; emotional speech recognition; human-machine interaction; network topologies; neural network application; prosodic features; spectral features; Accuracy; Emotion recognition; Feature extraction; Network topology; Neurons; Speech; Speech recognition; emotional speech recognition; neural network;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Network Applications in Electrical Engineering (NEUREL), 2012 11th Symposium on
Conference_Location
Belgrade
Print_ISBN
978-1-4673-1569-2
Type
conf
DOI
10.1109/NEUREL.2012.6420016
Filename
6420016
Link To Document