DocumentCode :
542240
Title :
Domain adaptation for TTS systems
Author :
Chu, Min ; Li, Chun ; Peng, Hu ; Chang, Eric
Author_Institution :
Microsoft Research Asia, Beijing, China
Volume :
1
fYear :
2002
fDate :
13-17 May 2002
Abstract :
This paper puts forward a domain adaptation problem that has not been studied well. For corpus-driven TTS systems, domain adaptation is realized by adding a small amount of domain-specific speech that will provide the maximum increase in average length of units that are used for synthesizing speech in that domain. An approach for generating optimized script for adaptation is proposed, the core of which is a dynamic programming based algorithm that segments domain-specific corpus into minimum number of segments that appear in the unit inventory. Increase in MOS after adaptation can be estimated from the generated script without recording speech from it. The results show that the amount of MOS increase depends not only on the size of the training set and the size of the script for adaptation, but also on the broadness of the domain. Narrower domains have larger increase in MOS.
Keywords :
Data mining; Decision support systems; Indium tin oxide; Open systems; Speech; Synthesizers; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
Conference_Location :
Orlando, FL, USA
ISSN :
1520-6149
Print_ISBN :
0-7803-7402-9
Type :
conf
DOI :
10.1109/ICASSP.2002.5743752
Filename :
5743752
Link To Document :
بازگشت