Title :
On the process of coarticulation for a CELP-based Chinese text-to-speech system using LSP frequencies
Author :
Chen, Jau-Hung ; Wu, Chung-Hsien
Author_Institution :
Inst. of Inf. Eng., Nat. Cheng Kung Univ., Tainan, Taiwan
Abstract :
This study proposes a novel approach based on a Bayesian network and the LSP frequencies to generate syllable prosody and the coarticulation between two concatenated syllables respectively. The Bayesian network is employed to model the relation between the prosodic information and the linguistic features. Given a Chinese character sequence, the Bayesian network can provide appropriate prosodic information, including pitch contour, syllable intensity, syllable duration and pause duration. Furthermore, the coarticulation is generated by adjusting the LSP frequencies in a CELP-based synthesizer. The synthesized speech is tested on twenty subjects. The test results indicate that the average correct rate is 95.8% for intelligibility, and the mean opinion score (MOS) is 3.2 for naturalness
Keywords :
Bayes methods; linear predictive coding; natural languages; speech coding; speech intelligibility; speech processing; speech synthesis; Bayesian network; CELP based synthesizer; Chinese character sequence; Chinese text to speech system; LSP frequencies; average correct rate; coarticulation; concatenated syllables; linguistic features; mean opinion score; pause duration; pitch contour; prosodic information; speech intelligibility; speech naturalness; syllable duration; syllable intensity; syllable prosody; synthesized speech; test results; Bayesian methods; Concatenated codes; Data mining; Frequency synthesizers; Natural languages; Network synthesis; Speech coding; Speech synthesis; Testing; Text analysis;
Conference_Titel :
TENCON '96. Proceedings., 1996 IEEE TENCON. Digital Signal Processing Applications
Conference_Location :
Perth, WA
Print_ISBN :
0-7803-3679-8
DOI :
10.1109/TENCON.1996.608697