مرکز منطقه ای اطلاع رساني علوم و فناوري - Neural network-based F0 text-to-speech synthesiser for Mandarin

DocumentCode :

1220923

Title :

Neural network-based F0 text-to-speech synthesiser for Mandarin

Author :

Hwang, S.-H. ; Chen, S.-H.

Author_Institution :

Dept. of Commun. Eng., Nat. Chiao Tung Univ., Hsinchu, Taiwan

Volume :

141

Issue :

fYear :

1994

fDate :

12/1/1994 12:00:00 AM

Firstpage :

384

Lastpage :

390

Abstract :

A neural-network-based approach to synthesising F0 information for Mandarin text-to-speech is discussed. The basic idea is to use neural networks to model the relationship between linguistic features. Extracted from input text and parameters representing the pitch contour of syllables. Two MLPs are used to separately synthesise the mean and shape of pitch contour, using different linguistic features. A large set of utterances is employed to train these MLPs using the well known back-propagation algorithm. Pronunciation rules for generating F0 information are automatically learned and implicitly memorised by the MLPs. In the synthesis, parameters representing the mean and shape of the pitch contour of each syllable are generated using linguistic features extracted from the given input text. Simulation results confirmed that this is a promising approach for F0 synthesis. The resulting synthesised pitch contours of syllables match well with their original counterparts. Average root mean square errors of 0.94 ms/frame and 1.00 ms/frame were achieved

Keywords :

backpropagation; multilayer perceptrons; natural languages; recurrent neural nets; speech synthesis; F0 synthesis; F0 text-to-speech synthesiser; Mandarin; average root mean square errors; back-propagation algorithm; linguistic features; mean; multilayer perceptron; neural networks; pitch contour; pronunciation rules; shape; simulation results; syllables;

fLanguage :

English

Journal_Title :

Vision, Image and Signal Processing, IEE Proceedings -

Publisher :

iet

ISSN :

1350-245X

Type :

jour

DOI :

10.1049/ip-vis:19941421

Filename :

342275

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1220923