Title :
Voice morphing by 3-D waveform interpolation surface and lossless tube area function
Author :
Porat, G. ; Lavner, Y.
Author_Institution :
Fac. of Electr. Eng., Technion-Israel Inst. of Technol., Haifa, Israel
Abstract :
Voice morphing is the process of gradually transforming the voice of a given speaker to that of another. The ability to change the speaker´s individual characteristics and produce high-quality voices can be used in many applications. For example, in multimedia and video entertainment, voice morphing is just like its visual counterpart: while seeing a face gradually changing from one person´s to another´s, we can simultaneously hear the voice changing as well. Another application could be in forensic voice identification: creating a voice-bank of different pitches, rates, and timbres, to assist in recognition of the suspect´s voice. In this study we present a new technique, which enables the production of N intermediate voices that gradually change between voices of two speakers, or one voice signal that changes gradually. This technique is based on two components. One is creating a 3D prototype waveform interpolation (PWI) surface from the residual error ´ signal, which is estimated from LPC analysis, to produce a new intermediate excitation signal. The second component is a representation of the vocal tract by a lossless tube area function, and interpolation of the two speakers´ parameters.
Keywords :
interpolation; linear predictive coding; speaker recognition; speech coding; speech synthesis; 3D prototype waveform interpolation surfaces; LPC analysis; PWI surfaces; forensic voice identification; high-quality voices; intermediate excitation signals; intermediate voices; lossless tube area function vocal tract representation; multimedia; pitch/rate/timbre voice-bank; residual error signals; speaker individual characteristics; speaker parameter interpolation; speaker voice transformation; speech analysis/synthesis; speech coding methods; video entertainment; voice changing; voice morphing; voice recognition; Filters; Interpolation; Linear predictive coding; Prototypes; Signal analysis; Signal synthesis; Speech analysis; Speech coding; Surface reconstruction; Surface waves;
Conference_Titel :
Electrical and Electronics Engineers in Israel, 2002. The 22nd Convention of
Print_ISBN :
0-7803-7693-5
DOI :
10.1109/EEEI.2002.1178443