Title :
Evaluation of parameter generation using high order dynamic features and long span windows for HMM based speech synthesis
Author :
Yang Wang ; Jianhua Tao
Author_Institution :
Nat. Lab. of Pattern Recognition, Inst. of Autom., Beijing, China
Abstract :
The essence of speech parameter generation from HMMs using dynamic features is to take full advantage of equation constraints between static and dynamic features, suppressing stepwise parameter sequence of consecutive mean vectors and forcing the generated sequence to be smooth. The equation constraints are demonstrated to be useful; however, the number of constraints and their concrete values are seldom investigated systematically and thoroughly. This paper considers many possible forms of high order dynamic features and long span windows by experimental evaluation. Objective and subjective experiments show that it is helpful to add the third order dynamics to the conventional configuration to achieve better performance for the evaluated male speaker. Moreover, more high order dynamics reduce unvoiced/voiced decision error rate, while just utilizing the first order dynamics minimizes reconstruction error on spectrum and fundamental frequency simultaneously.
Keywords :
hidden Markov models; speech synthesis; HMM based speech synthesis; consecutive mean vectors; evaluated male speaker; experimental evaluation; fundamental frequency; high order dynamic features; high order dynamics; long span windows; reconstruction error; spectrum; speech parameter generation; stepwise parameter sequence; unvoiced-voiced decision error rate; Equations; Error analysis; Hidden Markov models; Mathematical model; Speech; Speech synthesis; Vectors; higher order dynamic features; long span window; parameter generation;
Conference_Titel :
Chinese Spoken Language Processing (ISCSLP), 2014 9th International Symposium on
Conference_Location :
Singapore
DOI :
10.1109/ISCSLP.2014.6936663