• DocumentCode
    1357566
  • Title

    Voice Conversion Based on Weighted Frequency Warping

  • Author

    Erro, Daniel ; Moreno, Asunción ; Bonafonte, Antonio

  • Author_Institution
    TALP Res. Center, Univ. Politec. de Catalunya (UPC), Barcelona, Spain
  • Volume
    18
  • Issue
    5
  • fYear
    2010
  • fDate
    7/1/2010 12:00:00 AM
  • Firstpage
    922
  • Lastpage
    931
  • Abstract
    Any modification applied to speech signals has an impact on their perceptual quality. In particular, voice conversion to modify a source voice so that it is perceived as a specific target voice involves prosodic and spectral transformations that produce significant quality degradation. Choosing among the current voice conversion methods represents a trade-off between the similarity of the converted voice to the target voice and the quality of the resulting converted speech, both rated by listeners. This paper presents a new voice conversion method termed Weighted Frequency Warping that has a good balance between similarity and quality. This method uses a time-varying piecewise-linear frequency warping function and an energy correction filter, and it combines typical probabilistic techniques and frequency warping transformations. Compared to standard probabilistic systems, Weighted Frequency Warping results in a significant increase in quality scores, whereas the conversion scores remain almost unaltered. This paper carefully discusses the theoretical aspects of the method and the details of its implementation, and the results of an international evaluation of the new system are also included.
  • Keywords
    piecewise linear techniques; probability; speech synthesis; energy correction filter; piecewise linear function; probabilistic techniques; prosodic transformations; spectral transformations; time-varying function; voice conversion; weighted frequency warping; Gaussian mixture models (GMMs); harmonic plus stochastic model (HSM); speech synthesis; voice conversion; weighted frequency warping;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2009.2038663
  • Filename
    5353707