• DocumentCode
    454651
  • Title

    Applying Pitch Target Model to Convert F0 Contour for Expressive Mandarin Speech Synthesis

  • Author

    Kang, Yongguo ; Tao, Jianhua ; Xu, Bo

  • Author_Institution
    National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, ygkang@nlpr.ia.ac.cn
  • Volume
    1
  • fYear
    2006
  • fDate
    14-19 May 2006
  • Abstract
    In the paper, pitch target model is employed to represent and convert F0 contour for synthesizing an emotional Mandarin speech from a neutral speech. Compared with conventional F0 transforming methods, the proposed method converts F0 patterns described by pitch target parameters rather than F0 contours themselves, and uses Gaussian Mixture Model(GMM) and Classification and Regression Trees (CART) methods to build mapping functions for well-chosen pitch target parameters. Other prosodic parameters such as duration and intensity are also converted. Listening tests prove that these converted speeches express corresponding emotional states.
  • Keywords
    Automation; Classification tree analysis; Electronic switching systems; Laboratories; Paper technology; Pattern recognition; Regression tree analysis; Speech synthesis; Technological innovation; Vegetation mapping;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
  • Conference_Location
    Toulouse
  • ISSN
    1520-6149
  • Print_ISBN
    1-4244-0469-X
  • Type

    conf

  • DOI
    10.1109/ICASSP.2006.1660125
  • Filename
    1660125