DocumentCode :
2838720
Title :
A Mandarin TTS system with an integrated prosodic model
Author :
Pin, ShaoHuang ; Lee, Yehlin ; Chen, Yong-Cheng ; Wang, Hsin-Min ; Tseng, Chiu-Yu
Author_Institution :
Phonetics Lab, Acad. Sinica, Taipei, Taiwan
fYear :
2004
fDate :
15-18 Dec. 2004
Firstpage :
169
Lastpage :
172
Abstract :
Phrase grouping is essential to characterize the prosody of Mandarin fluent speech. Evidence of prosodic phrase grouping has been found both in adjustments of F0 contours and temporal allocations within and across phrases. We discuss the development of a Mandarin TTS system that integrates prosody processing modules, such as duration modeling, F0 modeling, and break predictions. The database consists of 1292×3 syllable-tokens chopped off specially designed three-phrase carrier sentences. Since temporal allocations and rhythmic structure in speech flow are carefully dealt with, the TTS system is capable of converting long paragraph text input into natural synthesized speech output.
Keywords :
natural language interfaces; speech synthesis; text analysis; Mandarin TTS system; break predictions; duration modeling; fluent speech; integrated prosodic model; long paragraph text; natural synthesized speech; prosodic phrase grouping; rhythmic structure; temporal allocations; three-phrase carrier sentences; Bismuth; Databases; Humans; Information science; Labeling; Modems; Natural languages; Predictive models; Speech analysis; Speech synthesis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Chinese Spoken Language Processing, 2004 International Symposium on
Print_ISBN :
0-7803-8678-7
Type :
conf
DOI :
10.1109/CHINSL.2004.1409613
Filename :
1409613
Link To Document :
بازگشت