Title :
HHT based long term feature extracting method for speech emotion classification
Author :
Wang, Zhenlu ; Li, Haifeng ; Ma, Lin
Author_Institution :
Sch. of Comput. Sci. & Technol., Harbin Inst. of Technol., Harbin, China
Abstract :
Speech Emotion Recognition has become more and more important in the development of human-machine interactions. Long term features for the entire sentence are usually obtained by using statistical functions on the low-level descriptors such as pitch or energy at a frame level. In this paper, unlike the former statistical method, we propose a set of new long term features, denoted by LTF-HMS, based on HHT method. Compared to those statistic features based on the FFT, we tested what an important role the LTF-HMS plays in the experiment and the Berlin Emotional Database is applied as our corpus. The experimental results demonstrate that 39.5% of misclassification samples in shorter sentences and 25% misclassification samples in longer sentences caused by using FFT are identified correctly after integrating the LTF-HMS features.
Keywords :
Hilbert transforms; emotion recognition; fast Fourier transforms; feature extraction; human computer interaction; pattern classification; speech recognition; statistical analysis; Berlin Emotional Database; FFT; HHT-based long-term feature extracting method; Hilbert-Huang transform; LTF-HMS; corpus; frame level; human-machine interactions; longer-sentences; low-level descriptors; misclassification samples; shorter-sentences; speech emotion classification; speech emotion recognition; statistical functions; Databases; Emotion recognition; Feature extraction; Mel frequency cepstral coefficient; Speech; Speech recognition; Transforms;
Conference_Titel :
Audio, Language and Image Processing (ICALIP), 2012 International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4673-0173-2
DOI :
10.1109/ICALIP.2012.6376625