Title :
Voice conversion using Bilinear Model integrated with joint GMM-based classification
Author :
Xinjian Sun ; Xiongwei Zhang ; Jibin Yang ; Tieyong Cao
Author_Institution :
Coll. of Commun. Eng., PLA Univ. of Sci. & Technol., Nanjing, China
Abstract :
Bilinear Model (BM) can express both characteristics within a speaker (style) and phonemes across speakers (content) independently in a speech database. It has a successful application in voice conversion (VC) by extrapolation. However, extrapolation suffers an undesired repetition of BM building and a large-scale estimation of parameters. To tackle these problems, we propose to enhance the normal BM-based VC scheme by integrating a joint Gaussian Mixture Model (GMM)-based classification, assuming that the GMM components correspond to the quasi-phoneme content classes. The enhanced scheme not only optimizes the VC algorithm in computation, but also improves the quality of speech compared to the normal BM-based one, as well as traditional GMM-based mapping system in evaluation experiments.
Keywords :
Gaussian processes; mixture models; parameter estimation; speech processing; Gaussian mixture model-based classification; bilinear model; large-scale parameter estimation; mapping system; quasi-phoneme content classes; speaker; speech database; speech quality; voice conversion; Buildings; Databases; Extrapolation; Joints; Speech; Training; Vectors;
Conference_Titel :
Information Science and Technology (ICIST), 2013 International Conference on
Conference_Location :
Yangzhou
Print_ISBN :
978-1-4673-5137-9
DOI :
10.1109/ICIST.2013.6747758