• DocumentCode
    2093457
  • Title

    Voive conversion based on a statistical model

  • Author

    Bi, QingGang ; Zhang, Linghua

  • Author_Institution
    Coll. of Telecommun. & Inf. Eng., Nanjing Univ. of Post & Telecommun., Nanjing, China
  • fYear
    2010
  • fDate
    11-14 Nov. 2010
  • Firstpage
    1414
  • Lastpage
    1417
  • Abstract
    In this paper, we propose to use a voice conversion method based on transformation of the characteristic features of a source speaker towards a target. The main objective of the work involves building a nonlinear relationship between parameters for the acoustical features of two speakers, based on a probabilistic model. The conversion rules involve the probabilistic classification and a cross correlation probability between the acoustic features of the two speakers. The parameters of the conversion rules are estimated by estimating the maximum likelihood of the training data. A comparative study of voice conversion with the proposed method and conventional vector quantization (VQ) is conducted. The experimental results of voice conversion evaluated using subjective and objective measures indicated that the performance can be dramatically improved by the proposed method.
  • Keywords
    maximum likelihood estimation; probability; speech processing; vector quantisation; VQ; acoustical features; conventional vector quantization; cross-correlation probability; maximum likelihood estimation; objective measures; probabilistic classification; probabilistic model; source speaker characteristic features; statistical model; subjective measures; voice conversion method; Hidden Markov models; Yttrium; Maximum likelihood (ML) estimation; VQ; voice conversion;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Communication Technology (ICCT), 2010 12th IEEE International Conference on
  • Conference_Location
    Nanjing
  • Print_ISBN
    978-1-4244-6868-3
  • Type

    conf

  • DOI
    10.1109/ICCT.2010.5689014
  • Filename
    5689014