DocumentCode
972255
Title
Unit-Centric Feature Mapping for Inventory Pruning in Unit Selection Text-to-Speech Synthesis
Author
Bellegarda, Jerome R.
Author_Institution
Speech & Language Technol., Apple, Inc., Cupertino, CA
Volume
16
Issue
1
fYear
2008
Firstpage
74
Lastpage
82
Abstract
The level of quality that can be attained in concatenative text-to-speech (TTS) synthesis is primarily governed by the inventory of units used in unit selection. This has led to the collection of ever larger corpora in the quest for ever more natural synthetic speech. As operational considerations limit the size of the unit inventory, however, pruning is critical to removing any instances that prove either spurious or superfluous. This paper proposes a novel pruning strategy based on a data-driven feature extraction framework separately optimized for each unit type in the inventory. A single distinctiveness/redundancy measure can then address, in a consistent manner, the two different problems of outliers and redundant units. Detailed analysis of an illustrative case study exemplifies the typical behavior of the resulting unit pruning procedure, and listening evidence suggests that both moderate and aggressive inventory pruning can be achieved with minimal degradation in perceived TTS quality. These experiments underscore the benefits of unit-centric feature mapping for database optimization in concatenative synthesis.
Keywords
feature extraction; speech synthesis; TTS synthesis; data-driven feature extraction framework; distinctiveness measure; inventory pruning; redundancy measure; unit selection text-to-speech synthesis; unit-centric feature mapping; Assembly; Concatenated codes; Cost function; Degradation; Feature extraction; Loudspeakers; Natural languages; Spatial databases; Spectral shape; Speech synthesis; Concatenative speech synthesis; inventory pruning; outlier removal; unit redundancy perception; unit selection;
fLanguage
English
Journal_Title
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher
ieee
ISSN
1558-7916
Type
jour
DOI
10.1109/TASL.2007.911059
Filename
4381231
Link To Document