DocumentCode
155679
Title
Parametric speech synthesis using local and global sparse Gaussian processes
Author
Koriyama, Tomoki ; Nose, Takashi ; Kobayashi, Takehiko
Author_Institution
Interdiscipl. Grad. Sch. of Sci. & Eng., Tokyo Inst. of Technol., Tokyo, Japan
fYear
2014
fDate
21-24 Sept. 2014
Firstpage
1
Lastpage
6
Abstract
This paper describes an application of Gaussian process regression (GPR) to parametric speech synthesis. GPR enables us to predict synthetic speech parameters by utilizing exemplars of training speech data directly without converting the acoustic features of training data into too small number of model parameters thanks to nonparametric Bayesian regression. However, GPR inherently requires high computational cost and resources. In this paper, to alleviate this problem, we incorporate local and global sparse Gaussian process approximation into the statistical speech synthesis framework, and investigate trade-off between computational cost and speech synthesis performance through experiments. Moreover, we examine the way of choosing pseudo data set used for the sparse GP approximation.
Keywords
Bayes methods; Gaussian processes; approximation theory; regression analysis; speech processing; speech synthesis; GPR; Gaussian process regression; computational cost; global sparse Gaussian process approximation; local sparse Gaussian process approximation; nonparametric Bayesian regression; parametric speech synthesis prediction; pseudodata set; sparse GP approximation; statistical speech synthesis framework; Approximation methods; Ground penetrating radar; Hidden Markov models; Kernel; Speech; Speech synthesis; Training; Gaussian process regression; parametric speech synthesis; partially independent conditional (PIC) approximation;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning for Signal Processing (MLSP), 2014 IEEE International Workshop on
Conference_Location
Reims
Type
conf
DOI
10.1109/MLSP.2014.6958921
Filename
6958921
Link To Document