DocumentCode
178941
Title
Complex cepstrum factorization for statistical parametric synthesis
Author
Maia, Ranniery ; Stylianou, Yannis
Author_Institution
Cambridge Res. Lab., Toshiba Res. Eur. Ltd., Cambridge, UK
fYear
2014
fDate
4-9 May 2014
Firstpage
3839
Lastpage
3843
Abstract
This paper presents a study on complex cepstrum-based speech factorization for acoustic modeling in statistical parametric synthesizers. The factorization is conducted assuming that both vocal tract resonance and glottal flow effect are fully represented by the complex cepstrum. We investigated four different forms to represent the complex cepstrum in the acoustic models and compared their performances in terms of objective measures between reconstructed and natural waveforms and final quality of the synthesized speech. According to experimental results, the all-pass/minimum-phase and real cepstrum/phase cepstrum decompositions are the best ones in terms of preserving the complex cepstrum information after the parameter generation process.
Keywords
cepstral analysis; matrix decomposition; speech synthesis; statistical analysis; acoustic modeling; all-pass-minimum-phase decompositions; complex cepstrum-based speech factorization; glottal flow effect; natural waveforms; objective measures; parameter generation process; real cepstrum-phase cepstrum decompositions; reconstructed waveforms; speech synthesis; statistical parametric synthesis; vocal tract resonance; Cepstrum; Production; Speech; Speech synthesis; Speech synthesis; complex cepstrum; speech production models; statistical parametric speech synthesis;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location
Florence
Type
conf
DOI
10.1109/ICASSP.2014.6854320
Filename
6854320
Link To Document