• DocumentCode
    2084658
  • Title

    Voice Conversion based on GMM and Artificial Neural Network

  • Author

    Peng, Danwen ; Zhang, Xiongwei ; Sun, Jian

  • Author_Institution
    Inst. of Commun. Eng., PLA Univ. of Sci. & Tech., Nanjing, China
  • fYear
    2010
  • fDate
    11-14 Nov. 2010
  • Firstpage
    1121
  • Lastpage
    1124
  • Abstract
    Voice Conversion (VC) technique allows to transform the voice of the source speaker so that it is perceived as uttered by the target speaker. In this paper, a novel VC method combining Gaussian Mixture Model (GMM) and Artificial Neural Network is proposed. To overcome the over-smoothing problem of GMM-based mapping method, we propose to convert the basic spectral envelope by GMM method and the residual envelope by ANN method. Compared with the traditional GMM based method, the proposed method can effectively improve the quality and naturalness of the converted speech. Experimental results using both objective tests and listening tests show the superiority of the new method.
  • Keywords
    Gaussian processes; neural nets; speech processing; voice communication; GMM-based mapping method; Gaussian mixture model; artificial neural network; source speaker; target speaker; voice conversion; voice transformation; Artificial neural networks; Smoothing methods; Variable speed drives;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Communication Technology (ICCT), 2010 12th IEEE International Conference on
  • Conference_Location
    Nanjing
  • Print_ISBN
    978-1-4244-6868-3
  • Type

    conf

  • DOI
    10.1109/ICCT.2010.5688637
  • Filename
    5688637