• DocumentCode
    2659374
  • Title

    Corpus-based synthesis of Mandarin speech with F0 contours generated by superposing tone components on rule-generated phrase components

  • Author

    Hirose, Keikichi ; Sun, Qinghua ; Minematsu, Nobuaki

  • Author_Institution
    Grad. Sch. of Inf. Sci. & Technol., Univ. of Tokyo, Tokyo
  • fYear
    2008
  • fDate
    15-19 Dec. 2008
  • Firstpage
    33
  • Lastpage
    36
  • Abstract
    Mandarin speech synthesis was conducted by generating prosodic features by the proposed method and segmental features by HMM-based method. The proposed method generates sentence fundamental frequency (F0) contours by representing them as a superposition of tone components on phrase components. The tone components are realized by concatenating their fragments at tone nuclei predicted by a corpus-based method, while the phrase components are generated by rules under the generation process model (F0 model) framework. The method includes prediction of phoneme/pause durations in a statistical method as the first step. Through a listening test on the quality of synthetic speech, it was shown that a better quality was obtainable by the method as compared to that by the full HMM-based method. It was also shown that a better quality is obtainable as compared to the case of generating F0 contours without super-positional scheme.
  • Keywords
    hidden Markov models; speech synthesis; HMM; Mandarin speech synthesis; corpus-based synthesis; prosodic features; rule-generated phrase components; sentence fundamental frequency contours; tone components; Degradation; Frequency; Hidden Markov models; Information science; Natural languages; Predictive models; Speech synthesis; Statistical analysis; System testing; F0 contour; HMM-based speech synthesis; Mandarin speech; phrase component; tone component;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Spoken Language Technology Workshop, 2008. SLT 2008. IEEE
  • Conference_Location
    Goa
  • Print_ISBN
    978-1-4244-3471-8
  • Electronic_ISBN
    978-1-4244-3472-5
  • Type

    conf

  • DOI
    10.1109/SLT.2008.4777833
  • Filename
    4777833