DocumentCode :
1723829
Title :
Voice conversion based on empirical conditional distribution in resource-limited scenarios
Author :
Ning Xu ; Yibin Tang ; Jingyi Bao ; Xiao Yao ; Aimin Jiang ; Xiaofeng Liu
Author_Institution :
Coll. of IoT Eng., Hohai Univ., Changzhou, China
fYear :
2015
Firstpage :
172
Lastpage :
173
Abstract :
In this paper, a computationally efficient voice conversion system has been designed in order to improve the performance in resource-limited scenarios. First, mixtures of Gaussians (MoGs) at fixed locations of Mel frequencies have been used to represent the spectrum of STRAIGHT compactly. Second, the key conditional distributions for prediction are approximated by building histograms of aligned features empirically. Experiments have confirmed that our proposed method can obtain fairly good results compared to the traditional method without huge computational costs.
Keywords :
Gaussian processes; speech enhancement; Gaussians mixtures; Mel frequencies; MoGs; STRAIGHT compactly spectrum; empirical conditional distribution; mixtures of Gaussians; voice conversion system; Buildings; Computational efficiency; Computational modeling; Feature extraction; Histograms; Speech; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Consumer Electronics - Taiwan (ICCE-TW), 2015 IEEE International Conference on
Conference_Location :
Taipei
Type :
conf
DOI :
10.1109/ICCE-TW.2015.7216839
Filename :
7216839
Link To Document :
بازگشت