DocumentCode :
2801577
Title :
Speech Emotion Recognition Using Segmental Level Prosodic Analysis
Author :
Koolagudi, Shashidhar G. ; Kumar, Nitin ; Rao, K. Sreenivasa
Author_Institution :
Sch. of Inf. Technol., Indian Inst. of Technol. Kharagpur, Kharagpur, India
fYear :
2011
fDate :
24-25 Feb. 2011
Firstpage :
1
Lastpage :
5
Abstract :
In this paper, prosodic analysis of speech segments is performed to recognise emotions. Speech signal is segmented into words and syllables. Energy and pitch parameters are extracted from utterances, words and syllables separately to develop emotion recognition models. Eight emotions (anger, disgust, fear, happy, neutral, sad, sarcastic and surprise) of simulated emotion speech corpus, IITKGP SESC [1] are used in this work for recognition of emotions. Word boundaries are manually marked for 15 utterances of IITKGP-SESC. Syllable boundaries are detected using vowel onset points (VOPs) as anchor locations. Recognition performance of emotions using segmental level prosodic features is not found to be appreciable, but by combining spectral features along with prosodic features, emotion recognition performance is considerably improved. Support vector machines (SVM) and Gaussian mixture models (GMM) are used to develop emotion models to analyse different speech segments for emotion recognition.
Keywords :
emotion recognition; speech recognition; support vector machines; Gaussian mixture model; IITKGP-SESC; SVM; segmental level prosodic analysis; speech emotion recognition; speech segmentation; support vector machine; vowel onset point; Databases; Emotion recognition; Feature extraction; Speech; Speech processing; Speech recognition; Support vector machines;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Devices and Communications (ICDeCom), 2011 International Conference on
Conference_Location :
Mesra
Print_ISBN :
978-1-4244-9189-6
Type :
conf
DOI :
10.1109/ICDECOM.2011.5738536
Filename :
5738536
Link To Document :
بازگشت