Title :
A first study on neural net based generation of prosodic and spectral information for Mandarin text-to-speech
Author :
Sin-Horng Chen ; Hwang, Shaw-IIwa ; Tsai, Chun-Yu
Author_Institution :
Dept. of Commun. Eng., Nat. Chiao Tung Univ., Hsinchu, Taiwan
Abstract :
A neural-network-based approach to generating prosodic and spectral information of syllables for Mandarin text-to-speech synthesis is studied. Some contextual features are first extracted from a given input text by text analysis and taken as input signals for synthesis. Then, six multilayer perceptrons are employed to generate pause duration, syllable duration, and pitch mean and shape of one- and two-syllable synthesis units, several reproduction templates of proper size are first generated for each synthesis unit of syllable approach. The objective is to generate spectral patterns of the syllable that can be directly concatenated to synthesize natural speech without further modification. The validity of this novel approach was examined by simulation using a database of sentential utterances recorded from TV news, reported by a single female announcer. Experimental results confirmed that this is a promising approach for Mandarin text-to-speech synthesis
Keywords :
feedforward neural nets; natural languages; speech synthesis; Mandarin text-to-speech; TV news; contextual features; female speech; multilayer perceptrons; natural speech; neural net based generation; one-syllable synthesis units; pause duration; pitch mean; prosodic information; sentential utterances; simulation; spectral information; spectral patterns; syllable duration; two-syllable synthesis units; Concatenated codes; Data mining; Feature extraction; Multilayer perceptrons; Natural languages; Neural networks; Shape; Signal synthesis; Speech synthesis; Text analysis;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
Conference_Location :
San Francisco, CA
Print_ISBN :
0-7803-0532-9
DOI :
10.1109/ICASSP.1992.226124