• DocumentCode
    1798677
  • Title

    An improved ANN method based on clustering optimization for voice conversion

  • Author

    Chen Xiantong ; Zhang Linghua

  • Author_Institution
    Coll. of Telecommun. & Inf. Eng., Nanjing Univ. of Posts & Telecommun., Nanjing, China
  • fYear
    2014
  • fDate
    7-9 July 2014
  • Firstpage
    464
  • Lastpage
    469
  • Abstract
    Artificial neural network is a commonly used conversion model in voice conversion system, in which RBF is known for its concise convergence and fast learning. Based on optimizing the centers of RBF network, this article presents a method of using K-means algorithm to cluster and form centers and PSO algorithm to optimize the clustering number to improve the property of RBF, thus to enhance the transformation of speech parameters. Firstly, STRAIGHT model is used to extract linear prediction coefficients and pitch frequencies. Then the parameters are sent to RBF network, K-means and PSO algorithms are used to optimize the centers of RBF network until the fitness value is lowest. Experiment shows that, this method not only eliminates the trouble of finding the best clustering number one-by-one, but also effectively improves the performance of neural network, and the converted speeches are closer to the target one.
  • Keywords
    particle swarm optimisation; pattern clustering; radial basis function networks; speech processing; ANN method; K-means algorithm; PSO algorithm; RBF network; STRAIGHT model; artificial neural network; clustering optimization; concise convergence; conversion model; converted speeches; fitness value; linear prediction coefficients; pitch frequencies; speech parameters; voice conversion system; Algorithm design and analysis; Clustering algorithms; Frequency conversion; Predictive models; Radial basis function networks; Speech; Training; K-means; PSO; RBF; STRAIGHT; voice conversion;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Audio, Language and Image Processing (ICALIP), 2014 International Conference on
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-1-4799-3902-2
  • Type

    conf

  • DOI
    10.1109/ICALIP.2014.7009837
  • Filename
    7009837