DocumentCode
2065587
Title
Modeling and Generating Tone Contour with Phrase Intonation for Mandarin Chinese Speech
Author
Wu, Zhizheng ; Qian, Yao ; Soong, Frank K. ; Zhang, Bo
fYear
2008
fDate
16-19 Dec. 2008
Firstpage
1
Lastpage
4
Abstract
This paper models F0 curves with discrete cosine transform (DCT) representations on both syllable-level tone and phrase-level intonation for Chinese Mandarin speech. Decision trees growing with maximum likelihood (ML) and stopping with minimum description length (MDL) are used to cluster very rich context-dependent DCT models into generalized ones to predict unseen contexts in test robustly. Additionally, we propose to generate Mandarin tone contours by jointly optimizing FO contours of syllable and phrase in ML sense. Experimental results on speaker-dependent continuous and speaker-independent isolated speech corpora show that the proposed approach can be able to generate FO contour with high correlation coefficients of 0.92 and 0.82 respectively, measured between the original and generated F0.
Keywords
decision trees; discrete cosine transforms; maximum likelihood estimation; natural language processing; speech processing; F0 curve model; Mandarin Chinese speech; correlation coefficients; decision trees; discrete cosine transform representation; maximum likelihood estimation; minimum description length; phrase-level intonation; speaker-dependent continuou speech corpora; speaker-independent isolated speech corpora; syllable-level tone; tone contour modeling; Asia; Discrete cosine transforms; Educational institutions; Fluctuations; Loudspeakers; Natural languages; Parametric statistics; Predictive models; Speech; Spline;
fLanguage
English
Publisher
ieee
Conference_Titel
Chinese Spoken Language Processing, 2008. ISCSLP '08. 6th International Symposium on
Conference_Location
Kunming
Print_ISBN
978-1-4244-2942-4
Electronic_ISBN
978-1-4244-2943-1
Type
conf
DOI
10.1109/CHINSL.2008.ECP.42
Filename
4730296
Link To Document