DocumentCode :
284796
Title :
Two-stage F0 control model using syllable based F0 units
Author :
Abe, Masanobu ; Sato, Hirokazu
Author_Institution :
NTT Human Interface Labs., Tokyo, Japan
Volume :
2
fYear :
1992
fDate :
23-26 Mar 1992
Firstpage :
53
Abstract :
Syllable-based F0 units (SBUs) are proposed for F0 contour generation, along with a two-stage strategy. The two-stage strategy provides a flexible F0 generation framework by introducing a global model and local model. The local model consists of the SBUs which make it possible to estimate the F0 contour precisely using segmental information. Experimental results show that the proposed approach can generate a good global model (the measured multiple correlation coefficient is 0.843) and can precisely estimate average F0 (the measured multiple correlation coefficient is 0.875). It is also confirmed that generating SBUs according to syllable positions is important in precisely estimating the F0 contour. Listening tests show that speech synthesized with the proposed model is preferred to the output of the conventional model. It is expected that the approach will prove to be useful and powerful for synthesizing various types of speech
Keywords :
speech analysis and processing; speech synthesis; fundamental frequency contour generation; global model; local model; multiple correlation coefficient; syllable based F0 units; synthesized speech; two-stage F0 control model; Data mining; Databases; Phase change materials; Speech;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
Conference_Location :
San Francisco, CA
ISSN :
1520-6149
Print_ISBN :
0-7803-0532-9
Type :
conf
DOI :
10.1109/ICASSP.1992.226122
Filename :
226122
Link To Document :
بازگشت