DocumentCode
2093457
Title
Voive conversion based on a statistical model
Author
Bi, QingGang ; Zhang, Linghua
Author_Institution
Coll. of Telecommun. & Inf. Eng., Nanjing Univ. of Post & Telecommun., Nanjing, China
fYear
2010
fDate
11-14 Nov. 2010
Firstpage
1414
Lastpage
1417
Abstract
In this paper, we propose to use a voice conversion method based on transformation of the characteristic features of a source speaker towards a target. The main objective of the work involves building a nonlinear relationship between parameters for the acoustical features of two speakers, based on a probabilistic model. The conversion rules involve the probabilistic classification and a cross correlation probability between the acoustic features of the two speakers. The parameters of the conversion rules are estimated by estimating the maximum likelihood of the training data. A comparative study of voice conversion with the proposed method and conventional vector quantization (VQ) is conducted. The experimental results of voice conversion evaluated using subjective and objective measures indicated that the performance can be dramatically improved by the proposed method.
Keywords
maximum likelihood estimation; probability; speech processing; vector quantisation; VQ; acoustical features; conventional vector quantization; cross-correlation probability; maximum likelihood estimation; objective measures; probabilistic classification; probabilistic model; source speaker characteristic features; statistical model; subjective measures; voice conversion method; Hidden Markov models; Yttrium; Maximum likelihood (ML) estimation; VQ; voice conversion;
fLanguage
English
Publisher
ieee
Conference_Titel
Communication Technology (ICCT), 2010 12th IEEE International Conference on
Conference_Location
Nanjing
Print_ISBN
978-1-4244-6868-3
Type
conf
DOI
10.1109/ICCT.2010.5689014
Filename
5689014
Link To Document