DocumentCode :
3246392
Title :
VTLN-based cross-language voice conversion
Author :
Sundermann, D. ; Ney, Hermann ; Hoge, H.
Author_Institution :
Comput. Sci. Dept., RWTH Aachen - Univ. of Technol., Germany
fYear :
2003
fDate :
30 Nov.-3 Dec. 2003
Firstpage :
676
Lastpage :
681
Abstract :
In speech recognition, vocal tract length normalization (VTLN) is a well-studied technique for speaker normalization. As cross-language voice conversion aims at the transformation of a source speaker´s voice into that of a target speaker using a different language, we want to investigate whether VTLN is an appropriate method to adapt the voice characteristics. After applying several conventional VTLN warping functions, we extend the conventional piece-wise linear function to several segments, allowing a more detailed warping of the source spectrum. Experiments on cross-language voice conversion are performed on three corpora of two languages and both speaker genders.
Keywords :
language translation; piecewise linear techniques; speech recognition; speech synthesis; VTLN warping functions; VTLN-based cross-language voice conversion; piece-wise linear warping; speaker normalization; speech recognition; vocal tract length normalization; voice characteristics; Frequency; Humans; Natural languages; Parameter estimation; Piecewise linear techniques; Smoothing methods; Speech processing; Speech recognition; Speech synthesis; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Automatic Speech Recognition and Understanding, 2003. ASRU '03. 2003 IEEE Workshop on
Print_ISBN :
0-7803-7980-2
Type :
conf
DOI :
10.1109/ASRU.2003.1318521
Filename :
1318521
Link To Document :
بازگشت