Title :
Spectral Dynamics as a Source of Discontinuity in Concatenative Speech Synthesis
Author :
Kirkpatrick, Barry ; O´Brien, Darragh ; Scaife, Ronán ; Errity, Andrew
Author_Institution :
Dublin City Univ., Dublin
Abstract :
The quality of concatenative speech synthesis depends on the cost function employed for unit selection. Effective cost functions for spectral continuity have proven difficult to define and standard measures do not accurately reflect human perception of spectral discontinuity in concatenated speech. Previous studies on spectral join costs have focused predominantly on static spectral measures extracted from the unit boundary. In this paper spectral dynamic behaviour is investigated as a source of discontinuity in concatenated speech. A number of measures representing spectral dynamics are tested for the task of detecting discontinuities. The spectral dynamic measures tested contain information correlating with human perception of discontinuities, suggesting that spectral dynamics are a source of discontinuity in concatenated speech. A strategy to effectively combine dynamic and static measures is proposed using principal component analysis (PCA).
Keywords :
principal component analysis; spectral analysis; speech synthesis; concatenative speech synthesis; cost functions; human perception; principal component analysis; spectral continuity; spectral dynamic behaviour; spectral dynamics; static spectral measures; Character generation; Concatenated codes; Cost function; Databases; Humans; Linear predictive coding; Principal component analysis; Probability density function; Speech synthesis; Testing; Concatenative speech synthesis; auditory perception; feature extraction; join cost; spectral dynamics;
Conference_Titel :
Digital Signal Processing, 2007 15th International Conference on
Conference_Location :
Cardiff
Print_ISBN :
1-4244-0882-2
Electronic_ISBN :
1-4244-0882-2
DOI :
10.1109/ICDSP.2007.4288657