Title :
Unit database pruning based on the cost degradation criterion for concatenative speech synthesis
Author :
Nishizawa, Nobuyuki ; Kawai, Hisashi
Author_Institution :
KDDI R&D Labs. Inc., Saitama
fDate :
March 31 2008-April 4 2008
Abstract :
A novel method of unit database pruning for concatenative speech synthesis is proposed. The proposed method uses sums of the unit preference criterion, which are calculated from cost degradation from the optimal sequence, instead of the appearance frequencies of units, which is used in the conventional method. Therefore, the proposed method is an extension of the conventional method. Since not only the optimal units but also the other candidate units can be taken into account for pruning, unit databases can be pruned with less experimental speech synthesis. The results of a unit selection experiment on 4-hour pruned unit databases built from the original 10.6-hour database indicate that the amount of the experimental speech synthesis can be reduced to 25% of that required for the conventional method without loss of the quality of synthetic speech in terms of average cost.
Keywords :
audio databases; speech synthesis; concatenative speech synthesis; cost degradation criterion; unit database pruning; unit preference criterion; Cost function; Degradation; Frequency synthesizers; Laboratories; Large-scale systems; Research and development; Spatial databases; Speech synthesis; Statistical distributions; Viterbi algorithm; Speech synthesis; database pruning; preselection; unit selection;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2008.4518523