Title :
On the modeling of natural vocal emotion expressions through binary key
Author :
Luque, Jordi ; Anguera, Xavier
Author_Institution :
Telefonica Res., Barcelona, Spain
Abstract :
This work presents a novel method to estimate natural expressed emotions in speech through binary acoustic modeling. Standard acoustic features are mapped to a binary value representation and a support vector regression model is used to correlate them with the three-continuous emotional dimensions. Three different sets of speech features, two based on spectral parameters and one on prosody are compared on the VAM corpus, a set of spontaneous dialogues from a German TV talk-show. The regression analysis, in terms of correlation coefficient and mean absolute error, show that the binary key modeling is able to successfully capture speaker emotion characteristics. The proposed algorithm obtains comparable results to those reported on the literature while it relies on a much smaller set of acoustic descriptors. Furthermore, we also report on preliminary results based on the combination of the binary models, which brings further performance improvements.
Keywords :
acoustic signal processing; emotion recognition; regression analysis; speech recognition; support vector machines; German TV talk-show; VAM corpus; acoustic descriptors; binary acoustic modeling; binary key modeling; binary value representation; correlation coefficient; mean absolute error; natural vocal emotion expression modelling; speaker emotion characteristics; spectral parameters; speech features; spontaneous dialogues; standard acoustic feature mapping; support vector regression model; three-continuous emotional dimensions; Acoustics; Emotion recognition; Feature extraction; Speech; Speech recognition; Training; Vectors; Emotion modeling; VAM corpus; binary fingerprint; dimensional emotions;
Conference_Titel :
Signal Processing Conference (EUSIPCO), 2014 Proceedings of the 22nd European
Conference_Location :
Lisbon