DocumentCode :
134352
Title :
Correlation-based frequency warping for voice conversion
Author :
Xiaohai Tian ; Zhizheng Wu ; Lee, S.W. ; Eng Siong Chng
Author_Institution :
Sch. of Comput. Eng., Nanyang Technol. Univ., Singapore, Singapore
fYear :
2014
fDate :
12-14 Sept. 2014
Firstpage :
211
Lastpage :
215
Abstract :
Frequency warping (FW) based voice conversion aims to modify the frequency axis of source spectra towards that of the target. In previous works, the optimal warping function was calculated by minimizing the spectral distance of converted and target spectra without considering the spectral shape. Nevertheless, speaker timbre and identity greatly depend on vocal tract peaks and valleys of spectrum. In this paper, we propose a method to define the warping function by maximizing the correlation between the converted and target spectra. Different from the conventional warping methods, the correlation-based optimization is not determined by the magnitude of the spectra. Instead, both spectral peaks and valleys are considered in the optimization process, which also improves the performance of amplitude scaling. Experiments were conducted on VOICES database, and the results show that after amplitude scaling our proposed method reduced the mel-spectral distortion from 5.85 dB to 5.60 dB. The subjective listening tests also confirmed the effectiveness of the proposed method.
Keywords :
speech processing; FW based voice conversion; VOICES database; amplitude scaling; correlation-based frequency warping; correlation-based optimization; source spectra; speaker timbre; spectral distance; spectral shape; spectrum valley; subjective listening tests; vocal tract peaks; warping function; Accuracy; Correlation; Frequency conversion; Spectral shape; Speech; Speech processing; Correlation; Frequency warping; Speech synthesis; Voice conversion;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Chinese Spoken Language Processing (ISCSLP), 2014 9th International Symposium on
Conference_Location :
Singapore
Type :
conf
DOI :
10.1109/ISCSLP.2014.6936725
Filename :
6936725
Link To Document :
بازگشت