DocumentCode
591768
Title
Effective sentence selection based on phone/model coverage maximization for speaker adaptation in HMM-based speech synthesis
Author
Cheng Hsien Lin ; Po Kai Huang ; Cheng Yuan Lin ; Chih Chung Kuo
Author_Institution
ITRI, Hsinchu, Taiwan
fYear
2012
fDate
5-8 Dec. 2012
Firstpage
74
Lastpage
78
Abstract
Reducing the recording effort required in practical speaker adaptive text-to-speech applications would be very useful. In this paper, we present two sentence selection approaches based on a greedy algorithm; one is based on phone coverage and the other is based on model coverage. The former considers the phonetic information in speaker adaptation data, while the latter focuses on occurrences of Mel-cepstral and logF0 models in decision trees of the average voice model. To verify the efficacy of the proposed methods, we compare their performance with that of a random selection method in objective and subjective evaluations. The objective and subjective evaluation results demonstrate that both methods outperform the random selection method.
Keywords
hidden Markov models; speech synthesis; HMM-based speech synthesis; Mel-cepstral models; logF0 models; model coverage; phone coverage; phone-model coverage maximization; random selection method; sentence selection; speaker adaptation data; speaker adaptive text-to-speech applications; Adaptation models; Data models; Greedy algorithms; Hidden Markov models; Speech; Speech synthesis; Training; HMM-based speech synthesis; greedy algorithm; model coverage; speaker adaptation;
fLanguage
English
Publisher
ieee
Conference_Titel
Chinese Spoken Language Processing (ISCSLP), 2012 8th International Symposium on
Conference_Location
Kowloon
Print_ISBN
978-1-4673-2506-6
Electronic_ISBN
978-1-4673-2505-9
Type
conf
DOI
10.1109/ISCSLP.2012.6423469
Filename
6423469
Link To Document