Title :
Cost Reduction of Training Mapping Function Based on Multistep Voice Conversion
Author :
Masuda, T. ; Shozakai, M.
Author_Institution :
New Bus. Dev., Asahi Kasei Corp., Kanagawa, Japan
Abstract :
Several approaches based on a statistical method for voice conversion from one speaker to another have been developed. In a statistical spectral mapping method which is a typical one in these approaches, a mapping function which represents a correlation between different speakers is determined using spectral features. This technique has the problem that it is necessary to train the mapping function for each speaker pair. The training cost must become a serious issue in case that the number of speakers increases significantly. This paper describes a novel voice conversion method for reducing the training cost. This technique is easily implemented and can use conventional techniques directly. Experimental results demonstrate that the converted speech is almost maintaining the conventional quality despite the significant training cost reduction by the proposed method.
Keywords :
feature extraction; speech synthesis; correlation; multistep voice conversion; spectral features; statistical spectral mapping method; training cost reduction; training mapping function; Cost function; Maximum likelihood estimation; Speech enhancement; Speech synthesis; Statistical analysis; Vector quantization; Voice conversion; multistep voice conversion; speech synthesis; training cost;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
Print_ISBN :
1-4244-0727-3
DOI :
10.1109/ICASSP.2007.367007