Title :
A new voice transformation method based on both linear and nonlinear prediction analysis
Author :
Lee, Ki Seung ; Youn, Dae Hee ; Cha, Il Whan
Author_Institution :
Dept. of Electron. Eng., Yonsei Univ., Seoul, South Korea
Abstract :
We describe a voice transformation method which changes the source speaker´s acoustic features to those of a target speaker. In the method acoustic features are divided into two parts, linear and nonlinear parts. Linear parts are characterized by LPC cepstrum coefficients which are obtained from LP analysis. The nonlinear part, which represents the excitation signal, is modelled by the long-delay nonlinear predictor using a neural net. Conversion rules for the excitation signal are generated by the average pitch ratio and the mapping codebook, and those for LPC cepstrum coefficients are based on the orthogonal vector space conversion. In addition, the spectral envelope compensation is proposed to correct spectral distortion. In the transformed speech a listening test shows that the proposed method makes it possible to convert speaker´s individuality while maintaining high quality
Keywords :
cepstral analysis; feature extraction; neural nets; speech recognition; speech synthesis; vectors; LP analysis; LPC cepstrum coefficients; acoustic features; average pitch ratio; conversion rules; excitation signal; linear prediction analysis; long-delay nonlinear predictor; mapping codebook; neural net; nonlinear prediction analysis; orthogonal vector space conversion; quality; source speaker; spectral distortion; spectral envelope compensation; voice transformation method; Acoustic distortion; Cepstral analysis; Cepstrum; Linear predictive coding; Loudspeakers; Neural networks; Nonlinear acoustics; Predictive models; Signal generators; Signal mapping;
Conference_Titel :
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
0-7803-3555-4
DOI :
10.1109/ICSLP.1996.607876