DocumentCode :
284793
Title :
Concatenative speech synthesis by minimum distortion criteria
Author :
Iwahashi, Naoto ; Kaiki, Nobuyoshi ; Sagisaka, Yoshinori
Author_Institution :
ATR Interpreting Telephony Res. Lab., Kyoto, Japan
Volume :
2
fYear :
1992
fDate :
23-26 Mar 1992
Firstpage :
65
Abstract :
A scheme is proposed for concatenative speech synthesis to improve the segment selection procedure by minimizing acoustic distortion between the selected segment and the desired spectrum for the target. The spectral prototypicality of a segment, the spectral difference between the source and target contexts, the degradation resulting from concatenation of phonemes, and the acoustic continuity between the concatenated segments are all considered as measures. A search method for selecting segments from a large speech database is also described. In this method, a three-step optimization is used for distortion minimization. A perceptual test shows that contextual spectral difference and acoustic continuity at the segment boundary are important measures for improving the quality of synthesized speech
Keywords :
speech synthesis; acoustic continuity; acoustic distortion; concatenative speech synthesis; distortion minimization; minimum distortion criteria; optimization; phonemes; search method; segment boundary; segment selection; spectral difference; speech database; speech quality; Acoustic distortion; Acoustic measurements; Concatenated codes; Databases; Degradation; Distortion measurement; Optimization methods; Prototypes; Search methods; Speech synthesis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
Conference_Location :
San Francisco, CA
ISSN :
1520-6149
Print_ISBN :
0-7803-0532-9
Type :
conf
DOI :
10.1109/ICASSP.1992.226119
Filename :
226119
Link To Document :
بازگشت