DocumentCode :
602037
Title :
Robust speech-based happiness recognition
Author :
Chang-Hong Lin ; Siahaan, Ernestasia ; Yu-Hau Chin ; Bo-Wei Chen ; Jia-Ching Wang ; Jhing-Fa Wang
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Nat. Central Univ., Jhongli, Taiwan
fYear :
2013
fDate :
12-16 March 2013
Firstpage :
227
Lastpage :
230
Abstract :
This paper presents a robust happiness recognition system. The system consists of a happiness recognition module and a noise suppression module. In the happiness recognition module, we present an emotion feature set comprising Mel-frequency cepstral coefficients (MFCCs), the subband powers, spectral centroid, spectral spread, spectral flatness, RSS, pitch and energy. The proposed feature set is fed into a probability product support vector machine for happiness recognition. In real world applications, the speech received are often exposed to noise, thus prone to reducing the recognition rate. We propose a noise suppression method using subspace based method. A gain function estimation method is used for time domain constrained (TDC) based subspace speech enhancement. The optimal Lagrange multiplier of the gain function will be estimated in accordance with signal to noise ratio (SNR) of the noisy speech. The proposed happiness recognition system has been tested using a large number of noisy speech utterances with a 34% equal error rate.
Keywords :
cepstral analysis; emotion recognition; probability; signal denoising; speech enhancement; support vector machines; time-domain analysis; MFCC; RSS; SNR; TDC; emotion feature set; energy; gain function estimation method; mel-frequency cepstral coefficients; noise suppression module; noisy speech utterances; optimal Lagrange multiplier estimation; pitch; probability product support vector machine; recognition rate reduction; robust speech-based happiness recognition module; signal-to-noise ratio; spectral centroid; spectral flatness; spectral spread; subband powers; subspace based method; time domain constrained based subspace speech enhancement; Kernel; Noise measurement; Signal to noise ratio; Speech; Speech enhancement; Speech recognition; Happiness recognition; emotional speech; noise suppression; probability product kernel;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Orange Technologies (ICOT), 2013 International Conference on
Conference_Location :
Tainan
Print_ISBN :
978-1-4673-5934-4
Type :
conf
DOI :
10.1109/ICOT.2013.6521198
Filename :
6521198
Link To Document :
بازگشت