Title :
Emotion detection with hybrid voice quality and prosodic features using Neural Network
Author :
Idris, Inshirah ; Salam, Md Sah Hj
Author_Institution :
Comput. Sci. Dept., Sudan Univ. of Sci. & Technol., Khartoum, Sudan
Abstract :
This paper investigates the detection of speech emotion using different sets of voice quality, prosodic and hybrid features. There are a total of five sets of emotion features experimented in this work which are two from voice quality features, one set from prosodic features and two hybrid features. The experimental data used in the work is from Berlin Emotional Database. Classification of emotion is done using Multi-Layer Perceptron, Neural Network. The results show that hybrid features gave better overall recognition rates compared to voice quality and prosodic features alone. The best overall recognition of hybrid features is 75.51% while for prosodic and voice quality features are 64.67% and 59.63% respectively. Nevertheless, the recognition performance of emotions are varies with the highest recognition rate is for anger with 88% while the lowest is disgust with only 52% using hybrid features.
Keywords :
emotion recognition; multilayer perceptrons; speech recognition; Berlin Emotional Database; emotion classification; hybrid voice quality; multilayer perceptron; neural network; prosodic features; speech emotion detection; Databases; Emotion recognition; Feature extraction; Jitter; Neural networks; Speech; Speech recognition; emotion recognition; multilayer perceptron; prosodic features; voice quality features;
Conference_Titel :
Information and Communication Technologies (WICT), 2014 Fourth World Congress on
Print_ISBN :
978-1-4799-8114-4
DOI :
10.1109/WICT.2014.7076906