DocumentCode
284796
Title
Two-stage F 0 control model using syllable based F 0 units
Author
Abe, Masanobu ; Sato, Hirokazu
Author_Institution
NTT Human Interface Labs., Tokyo, Japan
Volume
2
fYear
1992
fDate
23-26 Mar 1992
Firstpage
53
Abstract
Syllable-based F 0 units (SBUs) are proposed for F 0 contour generation, along with a two-stage strategy. The two-stage strategy provides a flexible F 0 generation framework by introducing a global model and local model. The local model consists of the SBUs which make it possible to estimate the F 0 contour precisely using segmental information. Experimental results show that the proposed approach can generate a good global model (the measured multiple correlation coefficient is 0.843) and can precisely estimate average F 0 (the measured multiple correlation coefficient is 0.875). It is also confirmed that generating SBUs according to syllable positions is important in precisely estimating the F 0 contour. Listening tests show that speech synthesized with the proposed model is preferred to the output of the conventional model. It is expected that the approach will prove to be useful and powerful for synthesizing various types of speech
Keywords
speech analysis and processing; speech synthesis; fundamental frequency contour generation; global model; local model; multiple correlation coefficient; syllable based F0 units; synthesized speech; two-stage F0 control model; Data mining; Databases; Phase change materials; Speech;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
Conference_Location
San Francisco, CA
ISSN
1520-6149
Print_ISBN
0-7803-0532-9
Type
conf
DOI
10.1109/ICASSP.1992.226122
Filename
226122
Link To Document