DocumentCode :
2017158
Title :
Improving GMM-based spectral conversion with optimal conversion function selection
Author :
Hwang, Hsin-Te ; Wu, Wen-Liang ; Chen, Sin-Horng
Author_Institution :
Dept. of Electr. Eng., Nat. Chiao Tung Univ., Hsinchu, Taiwan
fYear :
2010
fDate :
Nov. 29 2010-Dec. 3 2010
Firstpage :
392
Lastpage :
396
Abstract :
We address the problem in the conventional Gaussian mixture model (GMM)-based spectral conversion from the viewpoint of optimal conversion function selection. The proposed method is motivated by that if the optimal conversion function based on minimum mel-cepstral distortion (MMCD) criterion can be selected during the conversion stage, the conversion performance in terms of mel-cepstral distortion (MCD) can be improved dramatically. To this end, our goal is to improve the accuracy rate of the optimal conversion function selection by the MMCD-based data clustering with Linear Discriminant Analysis (LDA). Experiment results confirmed that the proposed method can effectively improve the conventional method.
Keywords :
Gaussian processes; pattern clustering; speech processing; speech synthesis; statistical analysis; GMM-based spectral conversion; Gaussian mixture model; data clustering; linear discriminant analysis; minimum mel-cepstral distortion; optimal conversion function selection; voice conversion; Accuracy; Joints; Linear discriminant analysis; Speech; Training; Transforms; Vectors; Gaussian mixture model (GMM); Voice conversion (VC);
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Chinese Spoken Language Processing (ISCSLP), 2010 7th International Symposium on
Conference_Location :
Tainan
Print_ISBN :
978-1-4244-6244-5
Type :
conf
DOI :
10.1109/ISCSLP.2010.5684860
Filename :
5684860
Link To Document :
بازگشت