DocumentCode :
2175434
Title :
Global variance modeling on frequency domain delta LSP for HMM-based speech synthesis
Author :
Pan, Shifeng ; Nankaku, Yoshihiko ; Tokuda, Keiichi ; Tao, Jianhua
Author_Institution :
Nat. Lab. of Pattern Recognition, Chinese Acad. of Sci., Beijing, China
fYear :
2011
fDate :
22-27 May 2011
Firstpage :
4716
Lastpage :
4719
Abstract :
The speech parameter generation algorithm considering global variance (GV) for HMM-based speech synthesis proved to be effective against the over-smoothing problem. However, the correlation between dimensions of parameter vector is not sufficiently considered in the current GV model. For some parameters, e.g., Line Spectral Pairs (LSP), the difference of adjacent LSPs has strong influence on the spectral envelope. Considering this important feature, the paper proposes a GV modeling on the difference of adjacent LSPs, i.e., GV on frequency domain delta LSP. By improving the GV likelihood on frequency domain delta LSP, the over-smoothing effect of generated parameter trajectory is better alleviated than conventional one. The result of a perceptual evaluation shows the proposed method outperforms the conventional one, and the naturalness of synthetic speech is improved.
Keywords :
hidden Markov models; smoothing methods; speech synthesis; GV likelihood; GV modeling; HMM-based speech synthesis; frequency domain delta LSP; global variance modeling; line spectral pairs; over-smoothing effect; speech parameter generation algorithm; Correlation; Frequency domain analysis; Hidden Markov models; Speech; Speech synthesis; Training; Trajectory; global variance; hidden Markov model; speech synthesis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
ISSN :
1520-6149
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2011.5947408
Filename :
5947408
Link To Document :
بازگشت