DocumentCode
1653201
Title
Analysis of stream-dependent tying structure for HMM-based speech synthesis
Author
Yu, Zhi-Peng ; Wu, Yi-Jian ; Zen, Heiga ; Nankaku, Yoshihiko ; Tokuda, Keiichi
Author_Institution
Nagoya Inst. of Technol., Nagoya
fYear
2008
Firstpage
655
Lastpage
658
Abstract
In conventional HMM-based speech synthesis framework, spectral features are modeled in one stream, and stream-dependent tree-based clustering was then applied for tying the model parameters. In this paper, we investigate several different stream-dependent tying structures for spectral features by splitting the feature vector into several streams. One splitting approach is to split each feature dimension into each stream. Another one is to split the static and dynamic features into different streams. Although splitting spectral features into different streams would ignore the correlation of context dependency between them, the number of model parameters can be optimized for each stream after stream-dependent clustering. From the experimental results, both splitting approaches can improve the quality of synthesized speech. However, the quality of synthesized speech became worse when we combined these two splitting approaches.
Keywords
hidden Markov models; pattern clustering; speech synthesis; tree data structures; HMM-based speech synthesis; stream-dependent tree-based clustering; stream-dependent tying structure; Context modeling; Hidden Markov models; High temperature superconductors; Probability distribution; Speech analysis; Speech synthesis; Stress; Testing; HMM-based speech synthesis; stream-dependent tying structure;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing, 2008. ICSP 2008. 9th International Conference on
Conference_Location
Beijing
Print_ISBN
978-1-4244-2178-7
Electronic_ISBN
978-1-4244-2179-4
Type
conf
DOI
10.1109/ICOSP.2008.4697216
Filename
4697216
Link To Document