Title :
Voive conversion based on a statistical model
Author :
Bi, QingGang ; Zhang, Linghua
Author_Institution :
Coll. of Telecommun. & Inf. Eng., Nanjing Univ. of Post & Telecommun., Nanjing, China
Abstract :
In this paper, we propose to use a voice conversion method based on transformation of the characteristic features of a source speaker towards a target. The main objective of the work involves building a nonlinear relationship between parameters for the acoustical features of two speakers, based on a probabilistic model. The conversion rules involve the probabilistic classification and a cross correlation probability between the acoustic features of the two speakers. The parameters of the conversion rules are estimated by estimating the maximum likelihood of the training data. A comparative study of voice conversion with the proposed method and conventional vector quantization (VQ) is conducted. The experimental results of voice conversion evaluated using subjective and objective measures indicated that the performance can be dramatically improved by the proposed method.
Keywords :
maximum likelihood estimation; probability; speech processing; vector quantisation; VQ; acoustical features; conventional vector quantization; cross-correlation probability; maximum likelihood estimation; objective measures; probabilistic classification; probabilistic model; source speaker characteristic features; statistical model; subjective measures; voice conversion method; Hidden Markov models; Yttrium; Maximum likelihood (ML) estimation; VQ; voice conversion;
Conference_Titel :
Communication Technology (ICCT), 2010 12th IEEE International Conference on
Conference_Location :
Nanjing
Print_ISBN :
978-1-4244-6868-3
DOI :
10.1109/ICCT.2010.5689014