DocumentCode :
2508546
Title :
Real life emotion classification using VOP and pitch based spectral features
Author :
Koolagudi, Shashidhar G. ; Rao, K. Sreenivasa
Author_Institution :
Sch. of Inf. Technol., Indian Inst. of Technol. Kharagpur, Kharagpur, India
fYear :
2010
fDate :
17-19 Dec. 2010
Firstpage :
1
Lastpage :
4
Abstract :
In this work, vowel onset points (VOPs) and pitch based spectral features are used for speech emotion classification. VOP is an anchor point from which vowel begins in a CV unit (generally a syllable). These are estimated using energy values of linear prediction (LP) residual, short time spectrum and modulation spectrum. Identification of vowel, consonant and CV transition regions of a syllable is done based on VOP locations. Spectral features are separately extracted from vowel, consonant and CV transition regions. Emotion recognition models (ERMs) are developed separately using these spectral features. Gaussian mixture models (GMMs) are used as classifiers. Pitch synchronous analysis is done on speech signal and spectral features are extracted from each pitch period to develop emotion models. Simulated emotion speech corpus, IITKGP-SESC is used as a database. Results of IITKGP-SESC are compared with those of Berlin emotion speech database Emo-DB. Emotion recognition performance observed is about 92% and 89% for IITKGP-SESC and Emo-DB respectively. Later the approach has been extended to the recognition of real life emotions, using IITKGP-movie emotion speech corpus (IITKGP-MESC). About 82% of emotion recognition performance is observed in this case.
Keywords :
Gaussian processes; emotion recognition; speech recognition; Gaussian mixture models; IITKGP-SESC; VOP; emotion recognition models; emotion speech corpus; linear prediction; pitch based spectral features; pitch synchronous analysis; real life emotion classification; speech emotion classification; vowel onset points; Databases; Emotion recognition; Feature extraction; Motion pictures; Speech; Speech processing; Speech recognition; CV-transition region; Consonant region; Emo-DB; Emotion classification; IITKGP-MESC; IITKGP-SESC; Spectral features; Vowel onset point; Vowel region;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
India Conference (INDICON), 2010 Annual IEEE
Conference_Location :
Kolkata
Print_ISBN :
978-1-4244-9072-1
Type :
conf
DOI :
10.1109/INDCON.2010.5712728
Filename :
5712728
Link To Document :
بازگشت