DocumentCode :
3343988
Title :
Voice conversion algorithm based on Gaussian mixture model with dynamic frequency warping of STRAIGHT spectrum
Author :
Toda, Tomoki ; Saruwatari, Hiroshi ; Shikano, Kiyohiro
Author_Institution :
Graduate Sch. of Inf. Sci., Nara Inst. of Sci. & Technol., Japan
Volume :
2
fYear :
2001
fDate :
2001
Firstpage :
841
Abstract :
In the voice conversion algorithm based on the Gaussian Mixture Model (GMM) applied to STRAIGHT, quality of converted speech is degraded because the converted spectrum is exceedingly smooth. We propose the GMM-based algorithm with dynamic frequency warping to avoid the over-smoothing. We also propose an addition of the weighted residual spectrum, which is the difference between the GMM-based converted spectrum and the frequency-warped spectrum, to avoid the deterioration of conversion-accuracy on speaker individuality. Results of the evaluation experiments clarify that the converted speech quality is better than that of the GMM-based algorithm, and the conversion-accuracy on speaker individuality is the same as that of the GMM-based algorithm in the proposed method with the properly-weighted residual spectrum
Keywords :
Gaussian processes; spectral analysis; speech processing; speech synthesis; GMM-based algorithm; Gaussian mixture model; STRAIGHT analysis-synthesis method; conversion-accuracy; dynamic frequency warping; speaker individuality; speech quality; voice conversion algorithm; weighted residual spectrum; Algorithm design and analysis; Databases; Degradation; Frequency conversion; Heuristic algorithms; Information science; Loudspeakers; Speech analysis; Speech synthesis; Vector quantization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
Conference_Location :
Salt Lake City, UT
ISSN :
1520-6149
Print_ISBN :
0-7803-7041-4
Type :
conf
DOI :
10.1109/ICASSP.2001.941046
Filename :
941046
Link To Document :
بازگشت