Title :
Voice Conversion based on GMM and Artificial Neural Network
Author :
Peng, Danwen ; Zhang, Xiongwei ; Sun, Jian
Author_Institution :
Inst. of Commun. Eng., PLA Univ. of Sci. & Tech., Nanjing, China
Abstract :
Voice Conversion (VC) technique allows to transform the voice of the source speaker so that it is perceived as uttered by the target speaker. In this paper, a novel VC method combining Gaussian Mixture Model (GMM) and Artificial Neural Network is proposed. To overcome the over-smoothing problem of GMM-based mapping method, we propose to convert the basic spectral envelope by GMM method and the residual envelope by ANN method. Compared with the traditional GMM based method, the proposed method can effectively improve the quality and naturalness of the converted speech. Experimental results using both objective tests and listening tests show the superiority of the new method.
Keywords :
Gaussian processes; neural nets; speech processing; voice communication; GMM-based mapping method; Gaussian mixture model; artificial neural network; source speaker; target speaker; voice conversion; voice transformation; Artificial neural networks; Smoothing methods; Variable speed drives;
Conference_Titel :
Communication Technology (ICCT), 2010 12th IEEE International Conference on
Conference_Location :
Nanjing
Print_ISBN :
978-1-4244-6868-3
DOI :
10.1109/ICCT.2010.5688637