• DocumentCode
    3124411
  • Title

    Detection and emphatic realization of contrastive word pairs for expressive text-to-speech synthesis

  • Author

    Chunrong Li ; Zhiyong Wu ; Fanbo Meng ; Meng, Hsiang-Yun ; Lianhong Cai

  • Author_Institution
    Tsinghua-CUHK Joint Res. Center for Media Sci., Technol. & Syst., Tsinghua Univ., Shenzhen, China
  • fYear
    2012
  • fDate
    5-8 Dec. 2012
  • Firstpage
    93
  • Lastpage
    97
  • Abstract
    This paper addresses the problem of automatic detection of contrastive word pairs and their acoustic realization in emphasis for expressive text-to-speech (TTS) synthesis in English. Support vector machines (SVMs) have been used to automatically detect contrastive word pairs from lexical features, syntactic dependencies and semantic relations. A much better performance is achieved by adding accent ratio and word identity features. Hidden Markov model (HMM) based speech synthesis is then used to generate emphatic speeches by putting emphasis on the detected contrastive word pairs. Subjective experiments show that most of the listeners consider putting emphasis on contrastive word pairs is more acceptable than on non-contrastive word pairs. This indicates the importance of the accurate detection of contrastive word pairs.
  • Keywords
    hidden Markov models; speech synthesis; support vector machines; English; HMM; SVM; TTS; acoustic realization; contrastive word pairs; emphatic realization; expressive text-to-speech synthesis; hidden Markov model; noncontrastive word pairs; support vector machines; word identity features; Feature extraction; Hidden Markov models; Semantics; Speech; Speech synthesis; Support vector machines; Syntactics; contrast; expressive text-to-speech (TTS) synthesis; hidden Markov model (HMM) based speech synthesis; support vector machines (SVMs);
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Chinese Spoken Language Processing (ISCSLP), 2012 8th International Symposium on
  • Conference_Location
    Kowloon
  • Print_ISBN
    978-1-4673-2506-6
  • Electronic_ISBN
    978-1-4673-2505-9
  • Type

    conf

  • DOI
    10.1109/ISCSLP.2012.6423493
  • Filename
    6423493