DocumentCode
1853010
Title
Modular Global Variance enhancement for voice conversion systems
Author
Benisty, H. ; Malah, D. ; Crammer, K.
Author_Institution
Dept. of Electr. Eng., Technion - Israel Inst. of Technol., Haifa, Israel
fYear
2012
fDate
27-31 Aug. 2012
Firstpage
370
Lastpage
374
Abstract
Voice conversion systems aim to transform sentences said by one speaker, to sound as if another speaker had said them. Many statistically trained conversion methods produce muffled synthesized outputs due to over-smoothing of the converted spectra. To deal with the muffling effect, conversion methods integrated with Global Variance (GV) enhancement, have been proposed. In order to gain the benefits of GV enhancement, the user is restricted to apply one of these methods as a conversion method. We propose a new GV enhancement method designed independently of any specific conversion scheme and applied as a post-processing block. The extent of GV enhancement is controlled through the allowed spectral distance between the enhanced and the originally converted output, as specified by the user. Listening tests showed that the proposed method improves both quality and similarity to the target of the examined converted sentences, outperforming other enhancement approaches that we evaluated.
Keywords
Gaussian processes; pattern matching; speech enhancement; GV enhancement method; global variance enhancement; modular global variance enhancement; muffling effect; post-processing block; spectral distance; statistically trained conversion methods; voice conversion systems; Matrix converters; Speech; Standards; Training; Vectors; Gaussian Mixture Model (GMM); Global Variance (GV); Log Spectral Distance (LSD); Voice Conversion;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing Conference (EUSIPCO), 2012 Proceedings of the 20th European
Conference_Location
Bucharest
ISSN
2219-5491
Print_ISBN
978-1-4673-1068-0
Type
conf
Filename
6334102
Link To Document