DocumentCode :
1749761
Title :
An RNN-based algorithm to detect prosodic phrase for Chinese TTS
Author :
Ying, Zhiwei ; Shi, Xiaohua
Author_Institution :
Intel China Res. Center, Beijing, China
Volume :
2
fYear :
2001
fDate :
2001
Firstpage :
809
Abstract :
The goal of the work presented here is to automatically predict the prosodic phrase boundaries from the text for Chinese TTS (text-to-speech) by using the trigram of the POS (part-of-speech) with information of the breaks between the prior two word-pairs by using a RNN (recurrent neural network). Prosodic phrase boundaries are very important to a Chinese TTS system because they will influence the prosodic model for speech synthesis. In this paper, the algorithm tries to use RNN to find some mapping relationship between the POS sequence and prosodic phrase boundaries, and hopes to improve the naturalness of synthesized speech
Keywords :
recurrent neural nets; speech synthesis; Chinese TTS; POS; RNN-based algorithm; mapping relationship; naturalness; part-of-speech; prosodic model; prosodic phrase boundaries; prosody; recurrent neural network; synthesized speech; text-to-speech; trigram; word-pairs; Acoustic signal detection; Chaos; Data mining; Natural languages; Partial response channels; Poles and towers; Predictive models; Recurrent neural networks; Speech synthesis; Text analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
Conference_Location :
Salt Lake City, UT
ISSN :
1520-6149
Print_ISBN :
0-7803-7041-4
Type :
conf
DOI :
10.1109/ICASSP.2001.941038
Filename :
941038
Link To Document :
بازگشت