DocumentCode
2702464
Title
Towards a Voice Conversion System Based on Frame Selection
Author
Dutoit, Thierry ; Holzapfel, A. ; Jottrand, M. ; Moinet, A. ; Perez, J.M. ; Stylianou, Yannis
Author_Institution
Faculte Polytech de Mons, Belgium
Volume
4
fYear
2007
fDate
15-20 April 2007
Abstract
The subject of this paper is the conversion of a given speaker´s voice (the source speaker) into another identified voice (the target one). We assume we have at our disposal a large amount of speech samples from source and target voice with at least a part of them being parallel. The proposed system is built on a mapping function between source and target spectral envelopes followed by a frame selection algorithm to produce final spectral envelopes. Converted speech is produced by a basic LP analysis of the source and LP synthesis using the converted spectral envelopes. We compared three types of conversion: without mapping, with mapping and using the excitation of the source speaker and finally with mapping using the excitation of the target. Results show that the combination of mapping and frame selection provide the best results, and underline the interest to work on methods to convert the LP excitation.
Keywords
linear predictive coding; speech coding; speech synthesis; LP analysis; LP synthesis; frame selection algorithm; spectral envelopes; voice conversion system; Context modeling; Databases; Frequency; Interpolation; Land mobile radio; Loudspeakers; Speech analysis; Speech synthesis; Training data; Vector quantization; Voice conversion; frame selection; voice mapping;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location
Honolulu, HI
ISSN
1520-6149
Print_ISBN
1-4244-0727-3
Type
conf
DOI
10.1109/ICASSP.2007.366962
Filename
4218150
Link To Document