• DocumentCode
    284796
  • Title

    Two-stage F0 control model using syllable based F0 units

  • Author

    Abe, Masanobu ; Sato, Hirokazu

  • Author_Institution
    NTT Human Interface Labs., Tokyo, Japan
  • Volume
    2
  • fYear
    1992
  • fDate
    23-26 Mar 1992
  • Firstpage
    53
  • Abstract
    Syllable-based F0 units (SBUs) are proposed for F0 contour generation, along with a two-stage strategy. The two-stage strategy provides a flexible F0 generation framework by introducing a global model and local model. The local model consists of the SBUs which make it possible to estimate the F0 contour precisely using segmental information. Experimental results show that the proposed approach can generate a good global model (the measured multiple correlation coefficient is 0.843) and can precisely estimate average F0 (the measured multiple correlation coefficient is 0.875). It is also confirmed that generating SBUs according to syllable positions is important in precisely estimating the F0 contour. Listening tests show that speech synthesized with the proposed model is preferred to the output of the conventional model. It is expected that the approach will prove to be useful and powerful for synthesizing various types of speech
  • Keywords
    speech analysis and processing; speech synthesis; fundamental frequency contour generation; global model; local model; multiple correlation coefficient; syllable based F0 units; synthesized speech; two-stage F0 control model; Data mining; Databases; Phase change materials; Speech;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
  • Conference_Location
    San Francisco, CA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-0532-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.1992.226122
  • Filename
    226122