• DocumentCode
    134270
  • Title

    Evaluation of parameter generation using high order dynamic features and long span windows for HMM based speech synthesis

  • Author

    Yang Wang ; Jianhua Tao

  • Author_Institution
    Nat. Lab. of Pattern Recognition, Inst. of Autom., Beijing, China
  • fYear
    2014
  • fDate
    12-14 Sept. 2014
  • Firstpage
    516
  • Lastpage
    520
  • Abstract
    The essence of speech parameter generation from HMMs using dynamic features is to take full advantage of equation constraints between static and dynamic features, suppressing stepwise parameter sequence of consecutive mean vectors and forcing the generated sequence to be smooth. The equation constraints are demonstrated to be useful; however, the number of constraints and their concrete values are seldom investigated systematically and thoroughly. This paper considers many possible forms of high order dynamic features and long span windows by experimental evaluation. Objective and subjective experiments show that it is helpful to add the third order dynamics to the conventional configuration to achieve better performance for the evaluated male speaker. Moreover, more high order dynamics reduce unvoiced/voiced decision error rate, while just utilizing the first order dynamics minimizes reconstruction error on spectrum and fundamental frequency simultaneously.
  • Keywords
    hidden Markov models; speech synthesis; HMM based speech synthesis; consecutive mean vectors; evaluated male speaker; experimental evaluation; fundamental frequency; high order dynamic features; high order dynamics; long span windows; reconstruction error; spectrum; speech parameter generation; stepwise parameter sequence; unvoiced-voiced decision error rate; Equations; Error analysis; Hidden Markov models; Mathematical model; Speech; Speech synthesis; Vectors; higher order dynamic features; long span window; parameter generation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Chinese Spoken Language Processing (ISCSLP), 2014 9th International Symposium on
  • Conference_Location
    Singapore
  • Type

    conf

  • DOI
    10.1109/ISCSLP.2014.6936663
  • Filename
    6936663