DocumentCode :
1653415
Title :
A precise estimation of vocal tract parameters for high quality voice morphing
Author :
Xu, Ning ; Yang, Zhen
Author_Institution :
Inst. of Signal Process. & Transm., Nanjing Univ. of Post & Telecommun., Nanjing
fYear :
2008
Firstpage :
684
Lastpage :
687
Abstract :
One of the most recent models for voice conversion is the classical LPC analysis-synthesis model combined with GMM, which aims to separate information from excitation and vocal tract and to learn the transformation rules with statistical methods. However, it does not work well as it is supposed to be due to the inaccuracy of the extracted feature information as well as the overly-smoothed spectral converted by traditional GMM. In this paper, we propose a novel method to solve the problem which is based on the technique of the separation of glottal waveforms and the prediction of the excitations. The final result shows that not only are the transformed vocal tract parameters matching the target one better, but also is the high quality of the synthesized speech preserved.
Keywords :
Gaussian processes; Markov processes; speech synthesis; GMM; classical LPC analysis-synthesis model; glottal waveforms; high quality voice morphing; speech synthesis; statistical methods; vocal tract parameters; voice conversion; Data mining; Feature extraction; Information analysis; Linear predictive coding; Signal analysis; Signal processing; Signal synthesis; Speech analysis; Speech synthesis; Statistical analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing, 2008. ICSP 2008. 9th International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-2178-7
Electronic_ISBN :
978-1-4244-2179-4
Type :
conf
DOI :
10.1109/ICOSP.2008.4697223
Filename :
4697223
Link To Document :
بازگشت