DocumentCode :
2406564
Title :
Emotions in Hindi speech- analysis, perception and recognition
Author :
Agrawal, S.S.
Author_Institution :
Coll. of Eng., KIIT, Gurgaon, India
fYear :
2011
fDate :
26-28 Oct. 2011
Firstpage :
7
Lastpage :
13
Abstract :
Human Speech conveys speaker´s emotional state along with linguistic intelligence. Meaning of a speech sample changes when it is uttered with different emotions. The present paper gives a description of different types of studies conducted to analyze, perceive and recognize commonly occurring emotions in Hindi speech. These have been classified as anger, happiness, fear, sadness, surprise in addition to neutral. Intonation, intensity and duration patterns changes due to changes in sentence types as well as due to changes in emotions. A relationship among the measured acoustic parameters and the patterns has been used to classify them. Experiments have been conducted to study and recognise emotions based on phonetic as well as prosodic parameters in the speech samples due to changes in emotions. These parameters include MFCC & their derivatives and prosodic parameters as the F0, A0 and Duration. In one of the experiment vowel segments taken from continuously spoken sentences and in another experiment Hindi digits were used as speech samples for machine recognition of emotions using the Neural Net classifiers. Human perception experiments have been conducted at all levels of experiments and compared the results with machine recognition performance. In most cases it has been found that machine recognition was found to be better compared to human performance. Both Phonetic as well as prosodic parameters play role in identification of emotions.
Keywords :
acoustic measurement; emotion recognition; linguistics; natural language processing; neural nets; signal classification; speech recognition; Hindi speech; MFCC; acoustic parameter measurement; emotion machine recognition; human speech; linguistic intelligence; neural net classifiers; phonetic parameter; prosodic parameter; speaker emotional state; vowel segments; Biological neural networks; Databases; Emotion recognition; Humans; Spectrogram; Speech; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Speech Database and Assessments (Oriental COCOSDA), 2011 International Conference on
Conference_Location :
Hsinchu
Print_ISBN :
978-1-4577-0930-2
Type :
conf
DOI :
10.1109/ICSDA.2011.6085972
Filename :
6085972
Link To Document :
بازگشت