DocumentCode
3705093
Title
Multi-stage children story speech synthesis for Hindi
Author
Harikrishna D M; Gurunath Reddy M;K. Sreenivasa Rao
Author_Institution
School of Information Technology, Indian Institute of Technology Kharagpur, 721302, India
fYear
2015
Firstpage
220
Lastpage
224
Abstract
In this paper, we propose a multi-stage children story speech synthesis system for Hindi language. The proposed system performs the following tasks: (i) classification of stories into different genres based on text, (ii) prediction of emotion from story text, (iii) deriving prosody rules (modification factors) specific to emotions and story genres and (iv) synthesis of story speech using mark-up language and prosody modification factors. Keyword and part-of-speech (POS) features are used for story-genre classification and emotion prediction. The prosody modification factors are derived carefully by analyzing the perceptual differences between synthesized neutral speech utterances and their respective utterances narrated by a storyteller. The story is synthesized by the festival based concatenative speech synthesizer with annotated story in the form of SABLE mark-up language. The quality and naturalness of the synthesized story speech is evaluated using subjective tests.
Keywords
"Speech","Synthesizers","Speech synthesis","Semantics","Frequency measurement","Market research"
Publisher
ieee
Conference_Titel
Contemporary Computing (IC3), 2015 Eighth International Conference on
Print_ISBN
978-1-4673-7947-2
Type
conf
DOI
10.1109/IC3.2015.7346682
Filename
7346682
Link To Document