DocumentCode
2084658
Title
Voice Conversion based on GMM and Artificial Neural Network
Author
Peng, Danwen ; Zhang, Xiongwei ; Sun, Jian
Author_Institution
Inst. of Commun. Eng., PLA Univ. of Sci. & Tech., Nanjing, China
fYear
2010
fDate
11-14 Nov. 2010
Firstpage
1121
Lastpage
1124
Abstract
Voice Conversion (VC) technique allows to transform the voice of the source speaker so that it is perceived as uttered by the target speaker. In this paper, a novel VC method combining Gaussian Mixture Model (GMM) and Artificial Neural Network is proposed. To overcome the over-smoothing problem of GMM-based mapping method, we propose to convert the basic spectral envelope by GMM method and the residual envelope by ANN method. Compared with the traditional GMM based method, the proposed method can effectively improve the quality and naturalness of the converted speech. Experimental results using both objective tests and listening tests show the superiority of the new method.
Keywords
Gaussian processes; neural nets; speech processing; voice communication; GMM-based mapping method; Gaussian mixture model; artificial neural network; source speaker; target speaker; voice conversion; voice transformation; Artificial neural networks; Smoothing methods; Variable speed drives;
fLanguage
English
Publisher
ieee
Conference_Titel
Communication Technology (ICCT), 2010 12th IEEE International Conference on
Conference_Location
Nanjing
Print_ISBN
978-1-4244-6868-3
Type
conf
DOI
10.1109/ICCT.2010.5688637
Filename
5688637
Link To Document