DocumentCode
399584
Title
Nonlinear speech model based on support vector machine and wavelet transform
Author
Li, Jianmin ; Zhang, Bo ; Lin, Fuzong
Author_Institution
Dept. of Comput. Sci. & Technol., Tsinghua Univ., Beijing, China
fYear
2003
fDate
3-5 Nov. 2003
Firstpage
259
Lastpage
263
Abstract
To improve the naturalness of reconstructed speech, nonlinear speech models are paid more and more attention in recent years. A nonlinear speech model for speech synthesis based on support vector machine (SVM) is presented firstly. After speech signal is embedded into phase space, nonlinear map in the model is obtained with support vector regression. It is shown in the experiments that for some pieces of speech, not only can speech be perfectly reconstructed by the system, but also jitter and shimmer in the original signal is preserved. However, the output of the system is quite different from the original one for other pieces. The reason is that the sub-bands with different frequency in the original signal can not be perfectly described by a SVM-based autoregressive model trained with one set of training parameters. Consequently, a multi-band model is then proposed. After the original speech is decomposed into several bands through wavelet packet decomposition, a nonlinear dynamical model based on SVM is constructed for each sub-band signal. It is shown in the experiments that the stability of such system is improved.
Keywords
speech processing; speech synthesis; support vector machines; wavelet transforms; SVM; autoregressive model; multiband model; nonlinear dynamical model; nonlinear speech model; phase space; speech reconstruction; speech signal; speech synthesis; subband signal; support vector machine; support vector regression; training parameter; wavelet packet decomposition; wavelet transform; Aerodynamics; Frequency; Jitter; Nonlinear filters; Production systems; Speech coding; Speech enhancement; Speech synthesis; Support vector machines; Wavelet transforms;
fLanguage
English
Publisher
ieee
Conference_Titel
Tools with Artificial Intelligence, 2003. Proceedings. 15th IEEE International Conference on
ISSN
1082-3409
Print_ISBN
0-7695-2038-3
Type
conf
DOI
10.1109/TAI.2003.1250199
Filename
1250199
Link To Document