DocumentCode :
1853010
Title :
Modular Global Variance enhancement for voice conversion systems
Author :
Benisty, H. ; Malah, D. ; Crammer, K.
Author_Institution :
Dept. of Electr. Eng., Technion - Israel Inst. of Technol., Haifa, Israel
fYear :
2012
fDate :
27-31 Aug. 2012
Firstpage :
370
Lastpage :
374
Abstract :
Voice conversion systems aim to transform sentences said by one speaker, to sound as if another speaker had said them. Many statistically trained conversion methods produce muffled synthesized outputs due to over-smoothing of the converted spectra. To deal with the muffling effect, conversion methods integrated with Global Variance (GV) enhancement, have been proposed. In order to gain the benefits of GV enhancement, the user is restricted to apply one of these methods as a conversion method. We propose a new GV enhancement method designed independently of any specific conversion scheme and applied as a post-processing block. The extent of GV enhancement is controlled through the allowed spectral distance between the enhanced and the originally converted output, as specified by the user. Listening tests showed that the proposed method improves both quality and similarity to the target of the examined converted sentences, outperforming other enhancement approaches that we evaluated.
Keywords :
Gaussian processes; pattern matching; speech enhancement; GV enhancement method; global variance enhancement; modular global variance enhancement; muffling effect; post-processing block; spectral distance; statistically trained conversion methods; voice conversion systems; Matrix converters; Speech; Standards; Training; Vectors; Gaussian Mixture Model (GMM); Global Variance (GV); Log Spectral Distance (LSD); Voice Conversion;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing Conference (EUSIPCO), 2012 Proceedings of the 20th European
Conference_Location :
Bucharest
ISSN :
2219-5491
Print_ISBN :
978-1-4673-1068-0
Type :
conf
Filename :
6334102
Link To Document :
بازگشت