DocumentCode
134352
Title
Correlation-based frequency warping for voice conversion
Author
Xiaohai Tian ; Zhizheng Wu ; Lee, S.W. ; Eng Siong Chng
Author_Institution
Sch. of Comput. Eng., Nanyang Technol. Univ., Singapore, Singapore
fYear
2014
fDate
12-14 Sept. 2014
Firstpage
211
Lastpage
215
Abstract
Frequency warping (FW) based voice conversion aims to modify the frequency axis of source spectra towards that of the target. In previous works, the optimal warping function was calculated by minimizing the spectral distance of converted and target spectra without considering the spectral shape. Nevertheless, speaker timbre and identity greatly depend on vocal tract peaks and valleys of spectrum. In this paper, we propose a method to define the warping function by maximizing the correlation between the converted and target spectra. Different from the conventional warping methods, the correlation-based optimization is not determined by the magnitude of the spectra. Instead, both spectral peaks and valleys are considered in the optimization process, which also improves the performance of amplitude scaling. Experiments were conducted on VOICES database, and the results show that after amplitude scaling our proposed method reduced the mel-spectral distortion from 5.85 dB to 5.60 dB. The subjective listening tests also confirmed the effectiveness of the proposed method.
Keywords
speech processing; FW based voice conversion; VOICES database; amplitude scaling; correlation-based frequency warping; correlation-based optimization; source spectra; speaker timbre; spectral distance; spectral shape; spectrum valley; subjective listening tests; vocal tract peaks; warping function; Accuracy; Correlation; Frequency conversion; Spectral shape; Speech; Speech processing; Correlation; Frequency warping; Speech synthesis; Voice conversion;
fLanguage
English
Publisher
ieee
Conference_Titel
Chinese Spoken Language Processing (ISCSLP), 2014 9th International Symposium on
Conference_Location
Singapore
Type
conf
DOI
10.1109/ISCSLP.2014.6936725
Filename
6936725
Link To Document