DocumentCode
1607260
Title
Better human computer interaction by enhancing the quality of text-to-speech synthesis
Author
Reddy, V.R. ; Rao, K. Sreenivasa
Author_Institution
Sch. of Inf. Technol., Indian Inst. of Technol. Kharagpur, Kharagpur, India
fYear
2012
Firstpage
1
Lastpage
6
Abstract
In this paper we propose high quality prosody models for enhancing the quality of text-to-speech (TTS) synthesis for providing better human computer interaction. In this study prosody refers to duration and intonation patterns of the sequence of syllables. In this work, prosody models are developed using feedforward neural networks, and prosodic information is predicted from linguistic and production constraints of syllables. The prediction accuracy of the proposed neural network based prosody models is compared objectively with Classification and Regression Tree based prosody models used by Festival. Subjective listening tests are also performed to evaluate the quality of the synthesized speech generated by incorporating the predicted prosodic features. From the evaluation studies, it is observed that prediction accuracy is better for neural network models, compared to other models.
Keywords
feedforward neural nets; human computer interaction; speech synthesis; text analysis; TTS synthesis quality enhancement; feedforward neural networks; human computer interaction; linguistic constraints; production constraints; prosodic information prediction; prosody models; subjective listening tests; syllable sequence duration; syllable sequence intonation patterns; text-to-speech synthesis quality enhancement; Accuracy; Computational modeling; Neural networks; Pragmatics; Predictive models; Production; Speech; Human computer interaction; festival; neural networks; prosody; text-to-speech;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Human Computer Interaction (IHCI), 2012 4th International Conference on
Conference_Location
Kharagpur
Print_ISBN
978-1-4673-4367-1
Type
conf
DOI
10.1109/IHCI.2012.6481857
Filename
6481857
Link To Document