• DocumentCode
    2132493
  • Title

    Representing fundamental frequency contours generated by HMM-based speech synthesis using generation process model

  • Author

    Hirose, Keikichi ; Matsuda, Tatsuya ; Hashimoto, Hiroya ; Minematsu, Nobuaki

  • Author_Institution
    Dept. of Inf. & Commun. Eng., Univ. of Tokyo, Tokyo, Japan
  • fYear
    2011
  • fDate
    18-21 Sept. 2011
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Frame-by-frame representation is not appropriate for prosodic features, which are tightly related to speech units spreading a wide time span, such as words, phrases and so on. This causes an inherit problem in fundamental frequency (F0) contour generation by HMM-based speech synthesis. A method is developed to modify F0 contours in the framework of a generation process model by referring to linguistic information of input text (word boundary and accent type). It takes F0 variances obtained through HMM-based speech synthesis into account during the process. Through a listening experiment on synthetic speech, the method is proved to generate better quality as compared to the HMM-based speech synthesis on average. Since the generation process model can clearly relate its commands and linguistic (and para-/non- linguistic) information, the method has an additional advantage; changing speech styles, and /or adding further information (such as emphasis) can be easily done through manipulating the commands.
  • Keywords
    hidden Markov models; speech synthesis; HMM; accent type; command manipulation; fundamental frequency contour generation; fundamental frequency contour representation; generation process model; speech synthesis; word boundary; Frequency synthesizers; Hidden Markov models; Mathematical model; Pragmatics; Speech; Speech synthesis; HMM-based speech synthesis; flexible control; fundamental frequency contour; generation process model; linguistic information;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning for Signal Processing (MLSP), 2011 IEEE International Workshop on
  • Conference_Location
    Santander
  • ISSN
    1551-2541
  • Print_ISBN
    978-1-4577-1621-8
  • Electronic_ISBN
    1551-2541
  • Type

    conf

  • DOI
    10.1109/MLSP.2011.6064596
  • Filename
    6064596