DocumentCode :
1615807
Title :
HMM based speech synthesis with Global Variance Training method
Author :
Tao, Jianhua
Author_Institution :
Inst. of Autom., Chinese Acad. of Sci., Beijing, China
fYear :
2010
Firstpage :
47
Lastpage :
47
Abstract :
Although Hidden Markov Model based speech synthesis has been proved to have good performance,there are still some factors which degrade the quality of synthesized speech: vocoder,model accuracy and over-smoothing. Experimental results show that over-smoothing in frequency domain mainly affect the quality of synthesized speech whereas over-smoothing in time domain can nearly be ignored. Time domain over-smoothing is generally caused by model structure accuracy problem and frequency domain over-smoothing is caused by training algorithm accuracy problem. ML-estimation based parameter training algorithm causes distortion of perception in speech synthesis. The talk will introduce a Global Variance (FV) based Training method into the HTS training structure. The new method tries to enlarge the variance of the spectrum and FO generation. The experiments show that the method improves the synthesizing performance both in voice quality and the expressiveness.
Keywords :
hidden Markov models; learning (artificial intelligence); speech processing; speech synthesis; vocoders; HMM based speech synthesis; HTS training structure; ML estimation based parameter training algorithm; frequency domain over smoothing; global variance based training method; global variance training method; hidden Markov model based speech synthesis; model accuracy; model structure accuracy problem; synthesized speech quality; time domain over smoothing; training algorithm accuracy problem; vocoder; voice quality;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Universal Communication Symposium (IUCS), 2010 4th International
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-7821-7
Type :
conf
DOI :
10.1109/IUCS.2010.5666649
Filename :
5666649
Link To Document :
بازگشت