• DocumentCode
    1562886
  • Title

    Genetic algorithm based RBF neural network for voice conversion

  • Author

    Guoyu Zuo ; Wenju Liu

  • Author_Institution
    Inst. of Autom., Chinese Acad. of Sci., Beijing, China
  • Volume
    5
  • fYear
    2004
  • Firstpage
    4215
  • Abstract
    Voice conversion is a technique used to transform one speaker´s voice into another speaker´s voice. This paper describes a study on voice conversion using genetic algorithm (GA) to train the hidden layer of RBF neural network, which helps to improve the preference of the converted speech for the target speaker´s characteristics. Six mono-vowel phonemes in Mandarin speech are used for the conversion experiments which are performed on neural networks, respectively, by GA-based and K-means methods. Subjective and objective evaluations are conducted on the performances of converted speech. The conversion results show that in spite of not much improvement in perceptual distance, the RBF network by genetic algorithm instead of by K-means method has the ability of global optimization with an evident decrease in the spectral distance between the converted speech and the target speech.
  • Keywords
    genetic algorithms; learning (artificial intelligence); pattern clustering; radial basis function networks; speech processing; GA based methods; K-means methods; Mandarin speech; RBF neural network; genetic algorithm; global optimization; monovowel phonemes; speaker voice conversion; Artificial neural networks; Clustering algorithms; Genetic algorithms; Hidden Markov models; Loudspeakers; Neural networks; Optimization methods; Radial basis function networks; Speech; Vector quantization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Control and Automation, 2004. WCICA 2004. Fifth World Congress on
  • Print_ISBN
    0-7803-8273-0
  • Type

    conf

  • DOI
    10.1109/WCICA.2004.1342304
  • Filename
    1342304