Title :
Voice conversion through transformation of spectral and intonation features
Author :
Rentzos, Dimitrios ; Vaseghi, Saeed ; Yan, Qin ; Ho, Ching-Hsiang
Author_Institution :
Dept. of Electron. & Comput. Eng., Brunel Univ., Uxbridge, UK
Abstract :
This paper presents a voice conversion method based on transformation of the characteristic features of a source speaker towards a target. Voice characteristic features are grouped into two main categories: (a) the spectral features at formants and (b) the pitch and intonation patterns. Signal modelling and transformation methods for each group of voice features are outlined. The spectral features at formants are modelled using a set of two-dimensional phoneme-dependent HMM. Subband frequency warping is used for spectrum transformation with the subbands centred on the estimates of the formant trajectories. The F0 contour is used for modelling the pitch and intonation patterns of speech. A PSOLA based method is employed for transformation of pitch, intonation patterns and speaking rate. The experiments present illustrations and perceptual evaluations of the results of transformations of the various voice features.
Keywords :
feature extraction; frequency estimation; hidden Markov models; spectral analysis; speech processing; F0 contour; PSOLA based method; characteristic features transformation; formant trajectory estimates; intonation features; phoneme-dependent HMM; pitch patterns; signal modelling; speaking rate; spectral features; spectrum transformation; subband frequency warping; two-dimensional HMM; voice characteristic features; voice conversion; Bandwidth; Cepstrum; Decoding; Feature extraction; Frequency estimation; Hidden Markov models; Pattern analysis; Spatial databases; Speech synthesis; Viterbi algorithm;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
Print_ISBN :
0-7803-8484-9
DOI :
10.1109/ICASSP.2004.1325912