Title :
Concatenative speech synthesis by minimum distortion criteria
Author :
Iwahashi, Naoto ; Kaiki, Nobuyoshi ; Sagisaka, Yoshinori
Author_Institution :
ATR Interpreting Telephony Res. Lab., Kyoto, Japan
Abstract :
A scheme is proposed for concatenative speech synthesis to improve the segment selection procedure by minimizing acoustic distortion between the selected segment and the desired spectrum for the target. The spectral prototypicality of a segment, the spectral difference between the source and target contexts, the degradation resulting from concatenation of phonemes, and the acoustic continuity between the concatenated segments are all considered as measures. A search method for selecting segments from a large speech database is also described. In this method, a three-step optimization is used for distortion minimization. A perceptual test shows that contextual spectral difference and acoustic continuity at the segment boundary are important measures for improving the quality of synthesized speech
Keywords :
speech synthesis; acoustic continuity; acoustic distortion; concatenative speech synthesis; distortion minimization; minimum distortion criteria; optimization; phonemes; search method; segment boundary; segment selection; spectral difference; speech database; speech quality; Acoustic distortion; Acoustic measurements; Concatenated codes; Databases; Degradation; Distortion measurement; Optimization methods; Prototypes; Search methods; Speech synthesis;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
Conference_Location :
San Francisco, CA
Print_ISBN :
0-7803-0532-9
DOI :
10.1109/ICASSP.1992.226119