DocumentCode
3528002
Title
A Bayesian approach to HMM-based speech synthesis
Author
Hashimoto, Kei ; Zen, Heiga ; Nankaku, Yoshihiko ; Masuko, Takashi ; Tokuda, Keiichi
Author_Institution
Dept. of Comput. Sci. & Eng., Nagoya Inst. of Technol., Nagoya
fYear
2009
fDate
19-24 April 2009
Firstpage
4029
Lastpage
4032
Abstract
This paper proposes a new framework of speech synthesis based on the Bayesian approach. The Bayesian method is a statistical technique for estimating reliable predictive distributions by marginalizing model parameters. In the proposed framework, all processes for constructing the system can be derived from one single predictive distribution which represents the basic problem of speech synthesis directly. Using HMM as the likelihood function and assuming some approximations, it can be regarded as an application of the variational Bayesian method to the HMM-based speech synthesis. Experimental results show that the proposed method outperforms the conventional one in a subjective test.
Keywords
Bayes methods; approximation theory; hidden Markov models; maximum likelihood estimation; speech synthesis; statistical distributions; variational techniques; HMM-based speech synthesis; approximation theory; marginalizing model parameter; maximum likelihood function; reliable predictive distribution; statistical technique; variational Bayesian method; Bayesian methods; Computer science; Context modeling; Hidden Markov models; Maximum likelihood estimation; Parameter estimation; Predictive models; Speech synthesis; Testing; Training data; HMM-based speech synthesis; context clustering; cross validation; prior distribution; variational Bayesian method;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location
Taipei
ISSN
1520-6149
Print_ISBN
978-1-4244-2353-8
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2009.4960512
Filename
4960512
Link To Document