Voice Conversion based on GMM and Artificial Neural Network

Author

Peng, Danwen ; Zhang, Xiongwei ; Sun, Jian

Author_Institution

Inst. of Commun. Eng., PLA Univ. of Sci. & Tech., Nanjing, China

fYear

2010

fDate

11-14 Nov. 2010

Firstpage

1121

Lastpage

1124

Abstract

Voice Conversion (VC) technique allows to transform the voice of the source speaker so that it is perceived as uttered by the target speaker. In this paper, a novel VC method combining Gaussian Mixture Model (GMM) and Artificial Neural Network is proposed. To overcome the over-smoothing problem of GMM-based mapping method, we propose to convert the basic spectral envelope by GMM method and the residual envelope by ANN method. Compared with the traditional GMM based method, the proposed method can effectively improve the quality and naturalness of the converted speech. Experimental results using both objective tests and listening tests show the superiority of the new method.

Keywords

Gaussian processes; neural nets; speech processing; voice communication; GMM-based mapping method; Gaussian mixture model; artificial neural network; source speaker; target speaker; voice conversion; voice transformation; Artificial neural networks; Smoothing methods; Variable speed drives;

fLanguage

English

Publisher

ieee

Conference_Titel

Communication Technology (ICCT), 2010 12th IEEE International Conference on

Conference_Location

Nanjing

Print_ISBN

978-1-4244-6868-3

Type

conf

DOI

10.1109/ICCT.2010.5688637

Filename

5688637

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=49&DC=2084658