DocumentCode :
2084658
Title :
Voice Conversion based on GMM and Artificial Neural Network
Author :
Peng, Danwen ; Zhang, Xiongwei ; Sun, Jian
Author_Institution :
Inst. of Commun. Eng., PLA Univ. of Sci. & Tech., Nanjing, China
fYear :
2010
fDate :
11-14 Nov. 2010
Firstpage :
1121
Lastpage :
1124
Abstract :
Voice Conversion (VC) technique allows to transform the voice of the source speaker so that it is perceived as uttered by the target speaker. In this paper, a novel VC method combining Gaussian Mixture Model (GMM) and Artificial Neural Network is proposed. To overcome the over-smoothing problem of GMM-based mapping method, we propose to convert the basic spectral envelope by GMM method and the residual envelope by ANN method. Compared with the traditional GMM based method, the proposed method can effectively improve the quality and naturalness of the converted speech. Experimental results using both objective tests and listening tests show the superiority of the new method.
Keywords :
Gaussian processes; neural nets; speech processing; voice communication; GMM-based mapping method; Gaussian mixture model; artificial neural network; source speaker; target speaker; voice conversion; voice transformation; Artificial neural networks; Smoothing methods; Variable speed drives;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Communication Technology (ICCT), 2010 12th IEEE International Conference on
Conference_Location :
Nanjing
Print_ISBN :
978-1-4244-6868-3
Type :
conf
DOI :
10.1109/ICCT.2010.5688637
Filename :
5688637
Link To Document :
بازگشت