DocumentCode
323793
Title
Speech synthesis using warped linear prediction and neural networks
Author
Karjalainen, Matti ; Altosaar, Toomas ; Vainio, Martti
Author_Institution
Lab. of Acoust. & Audio Signal Process., Helsinki Univ. of Technol., Espoo, Finland
Volume
2
fYear
1998
fDate
12-15 May 1998
Firstpage
877
Abstract
A text-to-speech synthesis technique, based on warped linear prediction (WLP) and neural networks, is presented for high-quality individual sounding synthetic speech. Warped linear prediction is used as a speech production model with wide audio bandwidth yet with highly compressed control parameter data. An excitation codebook, inverse filtered from a target speaker´s voice, is applied to obtain individual tone quality. A set of neural networks, specialized to yield synthesis control parameters from phonemic input in specific contexts, generate the detailed parametric controls of WLP. Neural nets are also used successfully to compute the prosodic parameters. We have applied this approach in prototyping highly improved text-to-speech synthesis for the Finnish language
Keywords
data compression; filtering theory; inverse problems; learning (artificial intelligence); linear predictive coding; neural nets; speech coding; speech intelligibility; speech synthesis; Finnish language; compressed control parameter data; excitation codebook; inverse filtered codebook; network training; neural networks; phonemic input; prosodic parameters; speech coding; synthesis control parameters; synthetic speech; text-to-speech synthesis; tone quality; warped linear prediction; Bandwidth; Control system synthesis; Humans; Network synthesis; Neural networks; Nonlinear filters; Predictive models; Sampling methods; Speech processing; Speech synthesis;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
Conference_Location
Seattle, WA
ISSN
1520-6149
Print_ISBN
0-7803-4428-6
Type
conf
DOI
10.1109/ICASSP.1998.675405
Filename
675405
Link To Document