• DocumentCode
    1853010
  • Title

    Modular Global Variance enhancement for voice conversion systems

  • Author

    Benisty, H. ; Malah, D. ; Crammer, K.

  • Author_Institution
    Dept. of Electr. Eng., Technion - Israel Inst. of Technol., Haifa, Israel
  • fYear
    2012
  • fDate
    27-31 Aug. 2012
  • Firstpage
    370
  • Lastpage
    374
  • Abstract
    Voice conversion systems aim to transform sentences said by one speaker, to sound as if another speaker had said them. Many statistically trained conversion methods produce muffled synthesized outputs due to over-smoothing of the converted spectra. To deal with the muffling effect, conversion methods integrated with Global Variance (GV) enhancement, have been proposed. In order to gain the benefits of GV enhancement, the user is restricted to apply one of these methods as a conversion method. We propose a new GV enhancement method designed independently of any specific conversion scheme and applied as a post-processing block. The extent of GV enhancement is controlled through the allowed spectral distance between the enhanced and the originally converted output, as specified by the user. Listening tests showed that the proposed method improves both quality and similarity to the target of the examined converted sentences, outperforming other enhancement approaches that we evaluated.
  • Keywords
    Gaussian processes; pattern matching; speech enhancement; GV enhancement method; global variance enhancement; modular global variance enhancement; muffling effect; post-processing block; spectral distance; statistically trained conversion methods; voice conversion systems; Matrix converters; Speech; Standards; Training; Vectors; Gaussian Mixture Model (GMM); Global Variance (GV); Log Spectral Distance (LSD); Voice Conversion;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference (EUSIPCO), 2012 Proceedings of the 20th European
  • Conference_Location
    Bucharest
  • ISSN
    2219-5491
  • Print_ISBN
    978-1-4673-1068-0
  • Type

    conf

  • Filename
    6334102