DocumentCode :
3123932
Title :
An improved tone labeling and prediction method with non-uniform segmentation of F0 contour
Author :
Xingyu Na ; Xiang Xie ; Jingming Kuang ; Yaling He
Author_Institution :
Sch. of Inf. & Electron., Beijing Inst. of Technol., Beijing, China
fYear :
2012
fDate :
5-8 Dec. 2012
Firstpage :
252
Lastpage :
255
Abstract :
This paper proposes a tone labeling technique for tonal language speech synthesis. Non-uniform segmentation using Viterbi alignment is introduced to determine the boundaries to get F0 symbols, which are used as tonal label to eliminate the mismatch between tone patterns and F0 contours of training data. During context clustering, the tendency of adjacent F0 state distributions are captured by the state-based phonetic trees. Means of tone model states are directly quantized to get full tonal label in the synthesis stage. Both objective and subjective experiment results show that the proposed technique can improve the perceptual prosody of synthetic speech of non-professional speakers.
Keywords :
speech processing; speech synthesis; statistical analysis; F0 contour; F0 state distribution; Viterbi alignment; context clustering; nonprofessional speaker; nonuniform segmentation; perceptual prosody; prediction method; state-based phonetic trees; synthetic speech; tonal language speech synthesis; tone labeling; tone pattern; Context; Hidden Markov models; Labeling; Speech; Speech synthesis; Training; Viterbi algorithm; F0 generation; F0 modeling; Statistical speech synthesis; Tone labeling;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Chinese Spoken Language Processing (ISCSLP), 2012 8th International Symposium on
Conference_Location :
Kowloon
Print_ISBN :
978-1-4673-2506-6
Electronic_ISBN :
978-1-4673-2505-9
Type :
conf
DOI :
10.1109/ISCSLP.2012.6423467
Filename :
6423467
Link To Document :
بازگشت