• DocumentCode
    178941
  • Title

    Complex cepstrum factorization for statistical parametric synthesis

  • Author

    Maia, Ranniery ; Stylianou, Yannis

  • Author_Institution
    Cambridge Res. Lab., Toshiba Res. Eur. Ltd., Cambridge, UK
  • fYear
    2014
  • fDate
    4-9 May 2014
  • Firstpage
    3839
  • Lastpage
    3843
  • Abstract
    This paper presents a study on complex cepstrum-based speech factorization for acoustic modeling in statistical parametric synthesizers. The factorization is conducted assuming that both vocal tract resonance and glottal flow effect are fully represented by the complex cepstrum. We investigated four different forms to represent the complex cepstrum in the acoustic models and compared their performances in terms of objective measures between reconstructed and natural waveforms and final quality of the synthesized speech. According to experimental results, the all-pass/minimum-phase and real cepstrum/phase cepstrum decompositions are the best ones in terms of preserving the complex cepstrum information after the parameter generation process.
  • Keywords
    cepstral analysis; matrix decomposition; speech synthesis; statistical analysis; acoustic modeling; all-pass-minimum-phase decompositions; complex cepstrum-based speech factorization; glottal flow effect; natural waveforms; objective measures; parameter generation process; real cepstrum-phase cepstrum decompositions; reconstructed waveforms; speech synthesis; statistical parametric synthesis; vocal tract resonance; Cepstrum; Production; Speech; Speech synthesis; Speech synthesis; complex cepstrum; speech production models; statistical parametric speech synthesis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
  • Conference_Location
    Florence
  • Type

    conf

  • DOI
    10.1109/ICASSP.2014.6854320
  • Filename
    6854320