DocumentCode :
1864129
Title :
Optimal residual frame based source modeling for HMM-based speech synthesis
Author :
Narendra, N.P. ; Sreenivasa Rao, K.
Author_Institution :
Sch. of Inf. Technol., Indian Inst. of Technol. Kharagpur, Kharagpur, India
fYear :
2015
fDate :
4-7 Jan. 2015
Firstpage :
1
Lastpage :
5
Abstract :
This paper proposes a method for modeling the excitation signal to improve the quality of HMM-based speech synthesis system (HTS). Single optimal residual frame which closely relates to all frames of phone is chosen to represent the entire residual signal of the phone. Optimal residual frames of all phones present in the speech corpus are efficiently grouped based on positional and contextual features to form a decision tree of clusters. During synthesis, suitable residual frames are picked up from the leaf of decision tree based on certain selection criteria. From each optimal residual frame, the entire residual signal corresponding to the phone is generated based on pitch and intensity contours obtained from HMMs. Subjective evaluation results show that the proposed excitation model can significantly improve the quality of HTS compared to traditional pulse excitation.
Keywords :
decision trees; hidden Markov models; speech synthesis; HMM-based speech synthesis system; HTS; cluster decision tree; contextual feature; hidden Markov model; intensity contour; optimal residual frame; pulse excitation; residual signal; source modeling; speech corpus; Databases; Decision trees; Feature extraction; Hidden Markov models; High-temperature superconductors; Speech; Speech synthesis; Excitation modeling; HMM-based speech synthesis; Hybrid synthesis; optimal residual frame;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advances in Pattern Recognition (ICAPR), 2015 Eighth International Conference on
Conference_Location :
Kolkata
Type :
conf
DOI :
10.1109/ICAPR.2015.7050668
Filename :
7050668
Link To Document :
بازگشت