Title :
Modulation spectrum-constrained trajectory training algorithm for GMM-based Voice Conversion
Author :
Takamichi, Shinnosuke ; Toda, Tomoki ; Black, Alan W. ; Nakamura, Satoshi
Author_Institution :
Grad. Sch. of Inf. Sci., Nara Inst. of Sci. & Technol. (NAIST), Nara, Japan
Abstract :
This paper presents a novel training algorithm for Gaussian Mixture Model (GMM)-based Voice Conversion (VC). One of the advantages of GMM-based VC is computationally efficient conversion processing enabling to achieve real-time VC applications. On the other hand, the quality of the converted speech is still significantly worse than that of natural speech. In order to address this problem while preserving the computationally efficient conversion processing, the proposed training method enables 1) to use a consistent optimization criterion between training and conversion and 2) to compensate a Modulation Spectrum (MS) of the converted parameter trajectory as a feature sensitively correlated with over-smoothing effects causing quality degradation of the converted speech. The experimental results demonstrate that the proposed algorithm yields significant improvements in term of both the converted speech quality and the conversion accuracy for speaker individuality compared to the basic training algorithm.
Keywords :
Gaussian processes; mixture models; speaker recognition; speech processing; GMM-based voice conversion; Gaussian mixture model; consistent optimization criterion; conversion accuracy improvement; converted parameter trajectory; converted speech quality improvement; modulation spectrum compensation; modulation spectrum-constrained trajectory training algorithm; over-smoothing effects; quality degradation; speaker individuality; Gold; Hafnium; Pragmatics; Speech; Training; GMM-based voice conversion; modulation spectrum; over-smoothing; training algorithm;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
DOI :
10.1109/ICASSP.2015.7178894