Title :
Text-Independent Voice Conversion Based on Unit Selection
Author :
Sündermann, David ; Höge, Harald ; Bonafonte, Antonio ; Ney, Hermann ; Black, Alan ; Narayanan, Shri
Author_Institution :
Siemens Corp. Technol., Munich
Abstract :
So far, most of the voice conversion training procedures are text-dependent, i.e., they are based on parallel training utterances of source and large speaker. Since several applications (e.g. speech-to-speech translation or dubbing) require text-independent training, over the last two years, training techniques that use non-parallel data were proposed In this paper, we present a new approach that applies unit selection to find corresponding time frames in source and target speech. By means of a subjective experiment it is shown that this technique achieves the same performance as the conventional text-dependent training
Keywords :
speech processing; nonparallel data; target speech; text-independent training; text-independent voice conversion; Feature extraction; Frequency domain analysis; Mel frequency cepstral coefficient; Natural languages; Parameter estimation; Predictive models; Speech analysis; Speech processing; Speech synthesis; Testing;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
Conference_Location :
Toulouse
Print_ISBN :
1-4244-0469-X
DOI :
10.1109/ICASSP.2006.1659962