DocumentCode
3546858
Title
Wavelet transform to hybrid support vector machine and hidden Markov model for speech recognition
Author
Shao, Yu ; Chang, Chip-Hong
Author_Institution
Center for Integrated Circuits & Syst., Nanyang Technol. Univ., Singapore
fYear
2005
fDate
23-26 May 2005
Firstpage
3833
Abstract
Recent work in machine learning has focused on models, such as the support vector machine (SVM), that automatically control generalization and parameterization as part of the overall optimization process. In this paper we construct a wavelet transform to hybrid support vector machine and hidden Markov models (WSVM/HMM) speech recognition system to deal with arbitrary nonlinear functions. The proposed hybrid system has overcome the problem of acoustic modeling in state-of-the-art speech recognition systems that usually relies on HMM with Gaussian emission densities. HMM suffer from intrinsic limitations, mainly due to their arbitrary parametric assumption. Artificial neural networks appear to be a promising alternative, but they have historically failed as a general solution to the acoustic modeling problem. The proposed system has successfully exploited the benefits of the wavelet technique, HMM and SVM in a unified framework. In this hybrid system, the WSVM is trained to estimate the emission probabilities of the states of the HMM. Simulations on benchmark databases show that speech recognition systems built around the hybrid WSVM/HMM provide excellent word recognition ratio and have performance superior to many other systems.
Keywords
generalisation (artificial intelligence); hidden Markov models; learning (artificial intelligence); nonlinear functions; speech recognition; support vector machines; wavelet transforms; SVM; WSVM/HMM; acoustic modeling; arbitrary nonlinear functions; emission probabilities; generalization; hidden Markov models; hybrid support vector machine; machine learning; parameterization; performance; speech recognition; speech recognition system; training; wavelet transform; word recognition ratio; Acoustic emission; Artificial neural networks; Automatic control; Databases; Hidden Markov models; Machine learning; Speech recognition; State estimation; Support vector machines; Wavelet transforms;
fLanguage
English
Publisher
ieee
Conference_Titel
Circuits and Systems, 2005. ISCAS 2005. IEEE International Symposium on
Print_ISBN
0-7803-8834-8
Type
conf
DOI
10.1109/ISCAS.2005.1465466
Filename
1465466
Link To Document