DocumentCode
2853081
Title
Synthesis of unseen context and spectral and pitch contour smoothing in concatenated text to speech synthesis
Author
Low, Phuay Hui ; Vaseghi, Saeed
Author_Institution
Department of Electronic and Computer Engineering, Brunei University, London, UB8 3PH, UK
Volume
1
fYear
2002
fDate
13-17 May 2002
Abstract
The availability and perceptual clarity of speech units, and how these units are put together during synthesis have always been the cornerstones of any high quality concatenative text-to-speech synthesis (TTS) system. The speech units are usually obtained from different sentences and contexts in a speaker-dependent speech database. One of the problems with speech units obtained this way is the occurrence of unseen contexts. Here, unseen contexts denote phonological sequences that are not acoustically represented in the selection pool during synthesis. Unseen units are expected in any concatenative TTS system because it is difficult to obtain an: acoustic representation of all possible existing contexts that could occur in speech. This paper proposes a pitch synchronous, overlap and merge method to synthesise the acoustic representation of unseen contexts from existing similar units found in the inventory. It also gives a brief description of spectral and pitch contour smoothing across concatenated units.
Keywords
Artificial neural networks; Speech; Testing; Transforms;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
Conference_Location
Orlando, FL, USA
ISSN
1520-6149
Print_ISBN
0-7803-7402-9
Type
conf
DOI
10.1109/ICASSP.2002.5743756
Filename
5743756
Link To Document