• DocumentCode
    3528002
  • Title

    A Bayesian approach to HMM-based speech synthesis

  • Author

    Hashimoto, Kei ; Zen, Heiga ; Nankaku, Yoshihiko ; Masuko, Takashi ; Tokuda, Keiichi

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Nagoya Inst. of Technol., Nagoya
  • fYear
    2009
  • fDate
    19-24 April 2009
  • Firstpage
    4029
  • Lastpage
    4032
  • Abstract
    This paper proposes a new framework of speech synthesis based on the Bayesian approach. The Bayesian method is a statistical technique for estimating reliable predictive distributions by marginalizing model parameters. In the proposed framework, all processes for constructing the system can be derived from one single predictive distribution which represents the basic problem of speech synthesis directly. Using HMM as the likelihood function and assuming some approximations, it can be regarded as an application of the variational Bayesian method to the HMM-based speech synthesis. Experimental results show that the proposed method outperforms the conventional one in a subjective test.
  • Keywords
    Bayes methods; approximation theory; hidden Markov models; maximum likelihood estimation; speech synthesis; statistical distributions; variational techniques; HMM-based speech synthesis; approximation theory; marginalizing model parameter; maximum likelihood function; reliable predictive distribution; statistical technique; variational Bayesian method; Bayesian methods; Computer science; Context modeling; Hidden Markov models; Maximum likelihood estimation; Parameter estimation; Predictive models; Speech synthesis; Testing; Training data; HMM-based speech synthesis; context clustering; cross validation; prior distribution; variational Bayesian method;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
  • Conference_Location
    Taipei
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-2353-8
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2009.4960512
  • Filename
    4960512