DocumentCode
454651
Title
Applying Pitch Target Model to Convert F0 Contour for Expressive Mandarin Speech Synthesis
Author
Kang, Yongguo ; Tao, Jianhua ; Xu, Bo
Author_Institution
National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, ygkang@nlpr.ia.ac.cn
Volume
1
fYear
2006
fDate
14-19 May 2006
Abstract
In the paper, pitch target model is employed to represent and convert F0 contour for synthesizing an emotional Mandarin speech from a neutral speech. Compared with conventional F0 transforming methods, the proposed method converts F0 patterns described by pitch target parameters rather than F0 contours themselves, and uses Gaussian Mixture Model(GMM) and Classification and Regression Trees (CART) methods to build mapping functions for well-chosen pitch target parameters. Other prosodic parameters such as duration and intensity are also converted. Listening tests prove that these converted speeches express corresponding emotional states.
Keywords
Automation; Classification tree analysis; Electronic switching systems; Laboratories; Paper technology; Pattern recognition; Regression tree analysis; Speech synthesis; Technological innovation; Vegetation mapping;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
Conference_Location
Toulouse
ISSN
1520-6149
Print_ISBN
1-4244-0469-X
Type
conf
DOI
10.1109/ICASSP.2006.1660125
Filename
1660125
Link To Document