DocumentCode :
1454404
Title :
Statistical Voice Conversion Based on Noisy Channel Model
Author :
Saito, Daisuke ; Watanabe, Shinji ; Nakamura, Atsushi ; Minematsu, Nobuaki
Author_Institution :
Dept. of Electr. Eng. & Inf. Syst., Univ. of Tokyo, Tokyo, Japan
Volume :
20
Issue :
6
fYear :
2012
Firstpage :
1784
Lastpage :
1794
Abstract :
This paper describes a novel framework of voice conversion effectively using both a joint density model and a speaker model. In voice conversion studies, approaches based on the Gaussian mixture model (GMM) with probabilistic densities of joint vectors of a source and a target speakers are widely used to estimate a transform function between both the speakers. However, to achieve sufficient quality, these approaches require a parallel corpus which contains plenty of utterances with the same linguistic content spoken by both the speakers. In addition, the joint density GMM methods often suffer from overtraining effects when the amount of training data is small. To compensate for these problems, we propose a voice conversion framework, which integrates the speaker GMM of the target with the joint density model using a noisy channel model. The proposed method trains the joint density model with a few parallel utterances, and the speaker model with nonparallel data of the target, independently. It can ease the burden on the source speaker. Experiments demonstrate the effectiveness of the proposed method, especially when the amount of the parallel corpus is small.
Keywords :
Gaussian processes; probability; speech synthesis; GMM methods; Gaussian mixture model; noisy channel model; parallel utterances; probabilistic densities; source speaker; speaker model; statistical voice conversion; voice conversion; Channel models; Joints; Noise measurement; Pragmatics; Speech; Speech enhancement; Vectors; Joint density model; noisy channel model; probabilistic integration; speaker model; voice conversion (VC);
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2012.2188628
Filename :
6156420
Link To Document :
بازگشت