مرکز منطقه ای اطلاع رساني علوم و فناوري - Non-parallel training for voice conversion based on FT-GMM

DocumentCode :

2177806

Title :

Non-parallel training for voice conversion based on FT-GMM

Author :

Chen, Ling-Hui ; Ling, Zhen-Hua ; Dai, Li-Rong

Author_Institution :

iFLYTEK Speech Lab., Univ. of Sci. & Technol. of China, Hefei, China

fYear :

2011

fDate :

22-27 May 2011

Firstpage :

5116

Lastpage :

5119

Abstract :

This paper presents a non-parallel training algorithm for voice con version based on feature transform Gaussian mixture model (FT GMM), which is a mixture model of joint density space of source speaker and target speaker with explicit feature transform modeling. In FT-GMM, the correlations between the distributions of two speakers in each component of the mixture model are not directly modeled, but absorbed into these explicit feature transformations. This makes it possible to extend this model to non-parallel training by simply decomposing it into two sub-models, one for each speaker and optimizing them separatively. A frequency warping process is adopted to compensate performance degradation caused by original spectral distance between source and target speakers. Cross-gender experimental results show that the proposed method achieves comparable performance as parallel training.

Keywords :

Gaussian processes; speech synthesis; FT-GMM; feature transform Gaussian mixture model; feature transform modeling; frequency warping process; nonparallel training algorithm; source speaker; source speakers; target speaker; target speakers; voice conversion; Cepstral analysis; Hidden Markov models; Joints; Speech; Training; Training data; Gaussian mixture model; Voice conversion; frequency warping; non-parallel training;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on

Conference_Location :

Prague

ISSN :

1520-6149

Print_ISBN :

978-1-4577-0538-0

Electronic_ISBN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2011.5947508

Filename :

5947508

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2177806