مرکز منطقه ای اطلاع رساني علوم و فناوري - Modeling prosody patterns for Chinese expressive text-to-speech synthesis

DocumentCode :

2009629

Title :

Modeling prosody patterns for Chinese expressive text-to-speech synthesis

Author :

Wu, Zhiyong ; Cai, Lianhong ; Meng, Helen M.

Author_Institution :

Tsinghua-CUHK Joint Res. Center for Media Sci., Tsinghua Univ., Shenzhen, China

fYear :

2010

fDate :

Nov. 29 2010-Dec. 3 2010

Firstpage :

148

Lastpage :

152

Abstract :

This paper proposes an approach for modeling the prosody patterns of the acoustic features for Chinese expressive text-to-speech (TTS) synthesis. Based on the observation that the speaker usually tends to put more emphasis on one particular syllable within a multi-syllabic prosodic word, we identify such syllable as the core syllable that can be derived from the semantic stress and tone information of the text prompt. We then classify the syllables in speech into four classes, based on their relations with the core syllable in a prosodic word. We analyze the contrastive (neutral versus expressive) speech recordings for each of four classes, and develop a perturbation model that takes into account the prosody pattern to transform neutral speech to expressive speech. Perceptual experiments on both neutral speech recordings and neutral TTS outputs involving 13 subjects indicate that the proposed approach can significantly enhance expressivity in synthesizing expressive speech.

Keywords :

natural language processing; speaker recognition; speech synthesis; text analysis; Chinese expressive text-to-speech synthesis; acoustic features; contrastive speech recordings; multisyllabic prosodic word; neutral speech; perturbation model; prosody patterns; semantic stress; speaker; text prompt; Acoustics; Hidden Markov models; Semantics; Speech; Speech synthesis; Stress; expressive text-to-speech (TTS); non-linear perturbaton model; prosody pattern;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Chinese Spoken Language Processing (ISCSLP), 2010 7th International Symposium on

Conference_Location :

Tainan

Print_ISBN :

978-1-4244-6244-5

Type :

conf

DOI :

10.1109/ISCSLP.2010.5684494

Filename :

5684494

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2009629