Title :
Generation of emotional speech by prosody imposition on sentence, word and syllable level fragments of neutral speech
Author :
Yadav, Jainath ; Rao, K. Sreenivasa
Author_Institution :
Sch. of Inf. Technol., Indian Inst. of Technol., Kharagpur, Kharagpur, India
Abstract :
In emotional-speech, it is observed that some words and phrases are spoken prominently, compared to neutral-speech. The prominence of these specific words and phrases are reflected in the form of prosodic features such as duration, intonation and intensity patterns of the words or phrases. The neutral speech and emotional speech have basic difference due to prosody aspects of speech. Three acoustic aspects of prosodic features were examined: the pitch contour, durations, and the intensity contour. These prosodic features from Hindi emotional-speech are imposed on Hindi neutral-speech at three different levels; sentence, word and syllable levels. The pitch contour, durations, and the intensity contour were imposed on neutral-speech using Praat tool with the help of Praat script. Subjective result indicates that syllable level fragments are good choice than word or sentence level fragments for generating emotional speech from neutral speech.
Keywords :
emotion recognition; feature extraction; natural language processing; speech synthesis; Hindi emotional-speech generation; Hindi neutral-speech; Praat script; Praat tool; acoustic aspects; intensity contour; pitch contour; prosodic features; prosody imposition; sentence level fragments; syllable level fragments; word level fragments; Conferences; Databases; Hidden Markov models; Speech; Speech synthesis; Neutral text-to-speech system; PSOLA Algorithm; Praat; Praat Script; duration pattern; intonation pattern; prosody;
Conference_Titel :
Cognitive Computing and Information Processing (CCIP), 2015 International Conference on
Conference_Location :
Noida
DOI :
10.1109/CCIP.2015.7100694