DocumentCode :
1987167
Title :
Telugu emotional story speech synthesis using SABLE markup language
Author :
Reddy, M. Gurunath ; Harikrishna, D.M. ; Rao, K. Sreenivasa ; Manjunath, K.E.
Author_Institution :
Sch. of Inf. Technol., Indian Inst. of Technol., Kharagpur, Kharagpur, India
fYear :
2015
fDate :
2-3 Jan. 2015
Firstpage :
331
Lastpage :
335
Abstract :
In this paper, a framework for synthesizing Telugu emotional speech for story telling applications is presented. An XML based markup langauge, SABLE is used to synthesize the emotions from a given story text. SABLE markup defines a set of tags to improve the quality of the synthesized speech from the concatinative speech synthesizer. In this work, a subset of prosody tags are used to synthesize the emotional speech from a given story text. Modified Zero frequency filtered (ZFF) signal is used to derive the prosody correlates of pitch base, pitch range and intensity. The desired prosody modification factors for each emotion is derived at phrase level. The derived prosody modification parameters for each emotion are stored in the form of a template. During synthesis, hand annotated story text is replaced by prosody tags which are stored in templates. Prosody tagged story text at phrase level is automatically converted into SABLE markup format. The markup story text is used to synthesize emotional speech from the Telugu neutral Festival TTS system. The quality and naturalness of the synthesised emotional story speech is evaluated using subjective tests.
Keywords :
XML; emotion recognition; filtering theory; speech synthesis; SABLE markup format; SABLE markup language; Telugu emotional speech; Telugu emotional story speech synthesis; Telugu neutral festival TTS system; XML based markup langauge; ZFF signal; concatinative speech synthesizer; hand annotated story text; markup story text; phrase level; pitch intensity; pitch range; prosody modification factor; prosody modification parameter; prosody tag; story telling application; synthesized speech; zero frequency filtered signal; Databases; Educational institutions; Pragmatics; Speech; Speech synthesis; Synthesizers; Emotional speech; Emotions; Prosody tags; SABLE markup; Synthetic speech; ZFF;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing And Communication Engineering Systems (SPACES), 2015 International Conference on
Conference_Location :
Guntur
Type :
conf
DOI :
10.1109/SPACES.2015.7058278
Filename :
7058278
Link To Document :
بازگشت