DocumentCode :
2705769
Title :
One-to-Many and Many-to-One Voice Conversion Based on Eigenvoices
Author :
Toda, Takechi ; Ohtani, Y. ; Shikano, Kiyohiro
Author_Institution :
Graduate Sch. of Inf. Sci., Nara Inst. of Sci. & Technol., Japan
Volume :
4
fYear :
2007
fDate :
15-20 April 2007
Abstract :
This paper describes two flexible frameworks of voice conversion (VC), i.e., one-to-many VC and many-to-one VC. One-to-many VC realizes the conversion from a user´s voice as a source to arbitrary target speakers´ ones and many-to-one VC realizes the conversion vice versa. We apply eigenvoice conversion (EVC) to both VC frameworks. Using multiple parallel data sets consisting of utterance-pairs of the user and multiple pre-stored speakers, an eigenvoice Gaussian mixture model (EV-GMM) is trained in advance. Unsupervised adaptation of the EV-GMM is available to construct the conversion model for arbitrary target speakers in one-to-many VC or arbitrary source speakers in many-to-one VC using only a small amount of their speech data. Results of various experimental evaluations demonstrate the effectiveness of the proposed VC frameworks.
Keywords :
Gaussian processes; acoustic signal processing; eigenvalues and eigenfunctions; eigenvoice Gaussian mixture model; eigenvoice conversion; many-to-one voice conversion; multiple parallel data sets; multiple pre-stored speakers; one-to-many voice conversion; Acoustics; Adaptation model; Information science; Interpolation; Loudspeakers; Natural languages; Speech recognition; Speech synthesis; Virtual colonoscopy; Yttrium; Speech synthesis; eigenvoice; many-to-one; one-to-many; voice conversion;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
ISSN :
1520-6149
Print_ISBN :
1-4244-0727-3
Type :
conf
DOI :
10.1109/ICASSP.2007.367303
Filename :
4218334
Link To Document :
بازگشت