Title :
Automatic phrase boundary labeling for Mandarin TTS corpus using context-dependent HMM
Author :
Yang, Chen-Yu ; Ling, Zhen-Hua ; Lu, Heng ; Guo, Wu ; Dai, Li-Rong
Author_Institution :
iFly Speech Lab., Univ. of Sci. & Technol. of China, Hefei, China
fDate :
Nov. 29 2010-Dec. 3 2010
Abstract :
In this paper, an automatic prosodic phrase boundary labeling method for speech synthesis database is presented. This method can be divided into two stages: training stage and labeling stage. In training stage, context-dependent HMM, which is commonly adopted in the HMM-based parametric speech synthesis, is estimated using the training database with manual prosodic labeling. In labeling stage, the maximum likelihood criterion derived from the trained HMMs and the exhaustive search method are employed to find the optimal phrase boundary positions for an input sentence based on its acoustic features. The experimental results show that an F-score of 76.46% can be achieved for the prosodic phrase boundary detection of our Mandarin TTS corpus, which is close to the accuracy of experienced human labelers.
Keywords :
hidden Markov models; speech synthesis; Mandarin TTS corpus; acoustic features; automatic phrase boundary labeling; context dependent HMM; hidden Markov model; manual prosodic labeling; maximum likelihood criterion; speech synthesis database; Acoustics; Context; Hidden Markov models; Labeling; Speech; Speech synthesis; Training; automatic labeling; hidden Markov model; prosodic phrase boundary; speech synthesis;
Conference_Titel :
Chinese Spoken Language Processing (ISCSLP), 2010 7th International Symposium on
Conference_Location :
Tainan
Print_ISBN :
978-1-4244-6244-5
DOI :
10.1109/ISCSLP.2010.5684864