Title :
Minimum unit selection error training for HMM-based unit selection speech synthesis system
Author :
Ling, Zhen-Hua ; Wang, Ren-Hua
Author_Institution :
iFlytek Speech Lab., Univ. of Sci. & Technol. of China, Hefei
fDate :
March 31 2008-April 4 2008
Abstract :
This paper presents a minimum unit selection error (MUSE) training method for HMM-based unit selection speech synthesis system, which selects the optimal phone-sized unit sequence from the speech database by maximizing the combined likelihood of a group of trained HMMs. Under MUSE criterion, the weights and distribution parameters of these HMMs are estimated to minimize the number of different units between the selected phone sequences and the natural phone sequences for the training sentences. The optimization is realized by discriminative training using generalized probabilistic descent (GPD) algorithm. Results of our experiment show that this proposed method is able to improve the performance of the baseline system where model weights are set manually and distribution parameters are trained under maximum likelihood criterion.
Keywords :
hidden Markov models; maximum likelihood estimation; probability; speech synthesis; HMM; baseline system; discriminative training; generalized probabilistic descent algorithm; maximum likelihood criterion; minimum unit selection error training; phone sequences; speech database; training sentences; unit selection speech synthesis system; Automation; Concatenated codes; Context modeling; Costs; Databases; Hidden Markov models; Laboratories; Maximum likelihood estimation; Robustness; Speech synthesis; HMM; Speech synthesis; discriminative training; minimum unit selection error; unit selection;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2008.4518518