DocumentCode :
1843428
Title :
Intonation and prosody conversion for expressive mandarin speech synthesis
Author :
Jing Zhu ; Yibiao Yu
Author_Institution :
Sch. of Electron. & Inf. Eng., Soochow Univ., Suzhou, China
Volume :
1
fYear :
2012
fDate :
21-25 Oct. 2012
Firstpage :
549
Lastpage :
552
Abstract :
Expressive speech synthesis has a wide variety of applications. Compared with general speech synthesis for Chinese, this paper focuses on prosody and intonation. Prosody is described from three aspects, accent, pause and speaking speed. Accent can be stressed by modifying fundamental frequency and amplitude. Pause is achieved by interpolating some frames which parameter value is zero. Speaking speed is controlled by copying or deleting some frames in specified location. Mandarin is a tonal language, so intonation is significant in the synthesis. There are four patterns of intonation, rising intonation, falling intonation, flat intonation and sinuate intonation. Use polynomial fitting function to model each intonation pattern. Apply the intonation model to convert one pattern to another. It can be seen from the experimental results, the proposed method can achieve a good quality on the conversion of tune and it can highly improve the naturalness of the speech.
Keywords :
natural language processing; polynomials; speech synthesis; Chinese; Mandarin; expressive mandarin speech synthesis; falling intonation; flat intonation; intonation conversion; polynomial fitting function; prosody conversion; sinuate intonation; speaking speed; tonal language; intonation; polynomial fitting; prosody; speech synthesis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing (ICSP), 2012 IEEE 11th International Conference on
Conference_Location :
Beijing
ISSN :
2164-5221
Print_ISBN :
978-1-4673-2196-9
Type :
conf
DOI :
10.1109/ICoSP.2012.6491547
Filename :
6491547
Link To Document :
بازگشت