Modular Global Variance enhancement for voice conversion systems

Author

Benisty, H. ; Malah, D. ; Crammer, K.

Author_Institution

Dept. of Electr. Eng., Technion - Israel Inst. of Technol., Haifa, Israel

fYear

2012

fDate

27-31 Aug. 2012

Firstpage

370

Lastpage

374

Abstract

Voice conversion systems aim to transform sentences said by one speaker, to sound as if another speaker had said them. Many statistically trained conversion methods produce muffled synthesized outputs due to over-smoothing of the converted spectra. To deal with the muffling effect, conversion methods integrated with Global Variance (GV) enhancement, have been proposed. In order to gain the benefits of GV enhancement, the user is restricted to apply one of these methods as a conversion method. We propose a new GV enhancement method designed independently of any specific conversion scheme and applied as a post-processing block. The extent of GV enhancement is controlled through the allowed spectral distance between the enhanced and the originally converted output, as specified by the user. Listening tests showed that the proposed method improves both quality and similarity to the target of the examined converted sentences, outperforming other enhancement approaches that we evaluated.

Keywords

Gaussian processes; pattern matching; speech enhancement; GV enhancement method; global variance enhancement; modular global variance enhancement; muffling effect; post-processing block; spectral distance; statistically trained conversion methods; voice conversion systems; Matrix converters; Speech; Standards; Training; Vectors; Gaussian Mixture Model (GMM); Global Variance (GV); Log Spectral Distance (LSD); Voice Conversion;

fLanguage

English

Publisher

ieee

Conference_Titel

Signal Processing Conference (EUSIPCO), 2012 Proceedings of the 20th European

Conference_Location

Bucharest

ISSN

2219-5491

Print_ISBN

978-1-4673-1068-0

Type

conf

Filename

6334102