Title :
An RNN-based algorithm to detect prosodic phrase for Chinese TTS
Author :
Ying, Zhiwei ; Shi, Xiaohua
Author_Institution :
Intel China Res. Center, Beijing, China
Abstract :
The goal of the work presented here is to automatically predict the prosodic phrase boundaries from the text for Chinese TTS (text-to-speech) by using the trigram of the POS (part-of-speech) with information of the breaks between the prior two word-pairs by using a RNN (recurrent neural network). Prosodic phrase boundaries are very important to a Chinese TTS system because they will influence the prosodic model for speech synthesis. In this paper, the algorithm tries to use RNN to find some mapping relationship between the POS sequence and prosodic phrase boundaries, and hopes to improve the naturalness of synthesized speech
Keywords :
recurrent neural nets; speech synthesis; Chinese TTS; POS; RNN-based algorithm; mapping relationship; naturalness; part-of-speech; prosodic model; prosodic phrase boundaries; prosody; recurrent neural network; synthesized speech; text-to-speech; trigram; word-pairs; Acoustic signal detection; Chaos; Data mining; Natural languages; Partial response channels; Poles and towers; Predictive models; Recurrent neural networks; Speech synthesis; Text analysis;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
Conference_Location :
Salt Lake City, UT
Print_ISBN :
0-7803-7041-4
DOI :
10.1109/ICASSP.2001.941038