DocumentCode :
2801335
Title :
Word-level emphasis modelling in HMM-based speech synthesis
Author :
Yu, K. ; Mairesse, F. ; Young, S.
Author_Institution :
Eng. Dept., Cambridge Univ., Cambridge, UK
fYear :
2010
fDate :
14-19 March 2010
Firstpage :
4238
Lastpage :
4241
Abstract :
Expressive speech synthesis has recently attracted great interest. Word-level emphasis is an important form of expressiveness to distinguish between what is the focus of the utterance, and what the computer system expects to be known by the user. Previous work on emphasis synthesis requires emphatic data collected specifically for that task. In this paper, a statistical approach that models and extracts word-level emphasis patterns from natural speech is investigated within the HMM based speech synthesis framework. Compared to emphatic speech collected specifically for this task, the cues of emphasis in natural speech are weaker and heavily affected by various suprasegmental features. Two new decision tree clustering approaches, two-pass and factorized decision tree, are proposed to effectively address this problem. Experiments show that both approaches can convey emphasis significantly better than traditional decision tree clustering and HMM adaptation. While the two-pass decision tree approach outperformed the factorized decision tree approach in an emphasis synthesis test, the latter led to significantly better naturalness and hence achieved a better overall balance.
Keywords :
decision trees; hidden Markov models; pattern clustering; speech synthesis; word processing; HMM-based speech synthesis; decision tree clustering; factorized decision tree; natural speech; word-level emphasis modelling; Control system synthesis; Data mining; Decision trees; Grounding; Hidden Markov models; High temperature superconductors; Intelligent systems; Natural languages; Speech synthesis; Testing; HMM based synthesis; decision tree; expressive speech synthesis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
ISSN :
1520-6149
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2010.5495690
Filename :
5495690
Link To Document :
بازگشت