DocumentCode
1935807
Title
A segment-based approach to voice conversion
Author
Abe, Masanobu
Author_Institution
ATR Interpreting Telephony Res. Lab., Kyoto, Japan
fYear
1991
fDate
14-17 Apr 1991
Firstpage
765
Abstract
A voice conversion algorithm that uses speech segments as conversion units is proposed. Input speech is decomposed into speech segments by a speech recognition module, and the segments are replaced by speech segments uttered by another speaker. This algorithm makes it possible to convert not only the static characteristics but also the dynamic characteristics of speaker individuality. The proposed voice conversion algorithm was used with two male speakers. Spectrum distortion between target speech and the converted speech was reduced to one-third the natural spectrum distortion between the two speakers. A listening experiment showed that, in terms of speaker identification accuracy, the speech converted by segment-sized units gave a score 20% higher than the speech converted frame-by-frame
Keywords
speech analysis and processing; speech synthesis; converted speech; dynamic characteristics; listening experiment; male speakers; natural spectrum distortion; speaker identification accuracy; speaker individuality; speech recognition module; speech segments; speech synthesis; static characteristics; target speech; voice conversion algorithm; Cepstral analysis; Databases; Hidden Markov models; Linear predictive coding; Loudspeakers; Speech analysis; Speech coding; Speech recognition; Speech synthesis; Telephony;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on
Conference_Location
Toronto, Ont.
ISSN
1520-6149
Print_ISBN
0-7803-0003-3
Type
conf
DOI
10.1109/ICASSP.1991.150451
Filename
150451
Link To Document