DocumentCode :
312281
Title :
Automatic generation of prosodic structure for high quality Mandarin speech synthesis
Author :
Chou, Fu-Chiang ; Tseng, Chiu-Yu ; Lee, Lin-shan
Author_Institution :
Dept. of Electr. Eng., Nat. Taiwan Univ., Taipei, Taiwan
Volume :
3
fYear :
1996
fDate :
3-6 Oct 1996
Firstpage :
1624
Abstract :
A key problem for today´s speech synthesis technology is to automatically generate an appropriate hierarchical prosodic structure for text input and incorporate it into synthesized speech. The paper presents a method for such a problem in Mandarin Chinese. This method uses a speech database for the training of a statistical model to generate the prosodic structure and determine prosodic parameters such as syllable duration, pause, energy and intonation. The experimental results show that an accuracy of 83.1% in the prediction of prosodic structure can be achieved. Furthermore, a Chinese text-to-speech system can be developed based on the proposed prosodic structure
Keywords :
natural languages; speech synthesis; statistical analysis; Chinese text-to-speech system; automatic hierarchical prosodic structure generation; energy; high quality Mandarin speech synthesis; intonation; pause; speech database; statistical model training; syllable duration; synthesized speech; text input; Appropriate technology; History; Information science; Labeling; Neural networks; Predictive models; Spatial databases; Speech synthesis; Tagging; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
0-7803-3555-4
Type :
conf
DOI :
10.1109/ICSLP.1996.607935
Filename :
607935
Link To Document :
بازگشت