Two-stage F₀ control model using syllable based F₀ units

Author

Abe, Masanobu ; Sato, Hirokazu

Author_Institution

NTT Human Interface Labs., Tokyo, Japan

Volume

2

fYear

1992

fDate

23-26 Mar 1992

Firstpage

53

Abstract

Syllable-based F₀ units (SBUs) are proposed for F₀ contour generation, along with a two-stage strategy. The two-stage strategy provides a flexible F₀ generation framework by introducing a global model and local model. The local model consists of the SBUs which make it possible to estimate the F₀ contour precisely using segmental information. Experimental results show that the proposed approach can generate a good global model (the measured multiple correlation coefficient is 0.843) and can precisely estimate average F₀ (the measured multiple correlation coefficient is 0.875). It is also confirmed that generating SBUs according to syllable positions is important in precisely estimating the F₀ contour. Listening tests show that speech synthesized with the proposed model is preferred to the output of the conventional model. It is expected that the approach will prove to be useful and powerful for synthesizing various types of speech

Keywords

speech analysis and processing; speech synthesis; fundamental frequency contour generation; global model; local model; multiple correlation coefficient; syllable based F₀ units; synthesized speech; two-stage F₀ control model; Data mining; Databases; Phase change materials; Speech;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on

Conference_Location

San Francisco, CA

ISSN

1520-6149

Print_ISBN

0-7803-0532-9

Type

conf

DOI

10.1109/ICASSP.1992.226122

Filename

226122

Two-stage F0 control model using syllable based F0 units

Abe, Masanobu ; Sato, Hirokazu

conf

Two-stage F₀ control model using syllable based F₀ units