Title :
Parametric speech synthesis based on Gaussian process regression using global variance and hyperparameter optimization
Author :
Koriyama, Tomoki ; Nose, Takashi ; Kobayashi, Takehiko
Author_Institution :
Interdiscipl. Grad. Sch. of Sci. & Eng., Tokyo Inst. of Technol., Yokohama, Japan
Abstract :
This paper examines two issues of a statistical speech synthesis approach based Gaussian process (GP) regression. Although GP-based speech synthesis can give higher performance in generating spectral parameters than the HMM-based one, a number of issues still remain. In this paper, we incorporate global variance (GV) feature to overcome over-smoothing problem into the parameter generation. Furthermore, in order to utilize an appropriate kernel function in accordance with actual data, we propose an EM-based kernel hyperparameter optimization technique. Objective and subjective evaluation results show that using GV and hyperparameter estimation enhanced the performance in spectral feature generation.
Keywords :
Gaussian processes; parameter estimation; regression analysis; speech synthesis; EM-based kernel hyperparameter optimization technique; GP-based speech synthesis; GV feature; Gaussian process regression; global variance feature; kernel function; objective evaluation; over-smoothing problem; parameter generation; parametric speech synthesis; spectral feature generation; spectral parameters; statistical speech synthesis approach; subjective evaluation; Acoustics; Hidden Markov models; Kernel; Optimization; Speech; Speech synthesis; Trajectory; Gaussian process; global variance; kernel hyperparameter; statistical parametric speech synthesis;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
DOI :
10.1109/ICASSP.2014.6854319