DocumentCode :
2174503
Title :
High accurate model-integration-based voice conversion using dynamic features and model structure optimization
Author :
Saito, Daisuke ; Watanabe, Shinji ; Nakamura, Atsushi ; Minematsu, Nobuaki
Author_Institution :
Univ. of Tokyo, Tokyo, Japan
fYear :
2011
fDate :
22-27 May 2011
Firstpage :
4576
Lastpage :
4579
Abstract :
This paper combines a parameter generation algorithm and a model optimization approach with the model-integration-based voice con version (MIVC). We have proposed probabilistic integration of a joint density model and a speaker model to mitigate a requirement of the parallel corpus in voice conversion (VC) based on Gaussian Mixture Model (GMM). As well as the other VC methods, MIVC also suffers from the problems; the degradation of the perceptual quality caused by the discontinuity through the parameter trajectory, and the difficulty to optimize the model structure. To solve the problems, this paper proposes a parameter generation algorithm constrained by dynamic features for the first problem and an information criterion including mutual influences between the joint density model and the speaker model for the second problem. Experimental results show that the first approach improved the performance of VC and the second approach appropriately predicted the optimal number of mixtures of the speaker model for our MIVC.
Keywords :
Gaussian distribution; integration; speaker recognition; GMM; Gaussian mixture model; MIVC; VC methods; dynamic feature; high accurate model-integration-based voice conversion; joint density model; model structure optimization; parameter generation algorithm; perceptual quality; speaker model; Adaptation models; Data models; Equations; Joints; Mathematical model; Optimization; Speech; Voice conversion; dynamic features; information criterion; probabilistic integration;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
ISSN :
1520-6149
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2011.5947373
Filename :
5947373
Link To Document :
بازگشت