Wavelet transform to hybrid support vector machine and hidden Markov model for speech recognition

Author

Shao, Yu ; Chang, Chip-Hong

Author_Institution

Center for Integrated Circuits & Syst., Nanyang Technol. Univ., Singapore

fYear

2005

fDate

23-26 May 2005

Firstpage

3833

Abstract

Recent work in machine learning has focused on models, such as the support vector machine (SVM), that automatically control generalization and parameterization as part of the overall optimization process. In this paper we construct a wavelet transform to hybrid support vector machine and hidden Markov models (WSVM/HMM) speech recognition system to deal with arbitrary nonlinear functions. The proposed hybrid system has overcome the problem of acoustic modeling in state-of-the-art speech recognition systems that usually relies on HMM with Gaussian emission densities. HMM suffer from intrinsic limitations, mainly due to their arbitrary parametric assumption. Artificial neural networks appear to be a promising alternative, but they have historically failed as a general solution to the acoustic modeling problem. The proposed system has successfully exploited the benefits of the wavelet technique, HMM and SVM in a unified framework. In this hybrid system, the WSVM is trained to estimate the emission probabilities of the states of the HMM. Simulations on benchmark databases show that speech recognition systems built around the hybrid WSVM/HMM provide excellent word recognition ratio and have performance superior to many other systems.

Keywords

generalisation (artificial intelligence); hidden Markov models; learning (artificial intelligence); nonlinear functions; speech recognition; support vector machines; wavelet transforms; SVM; WSVM/HMM; acoustic modeling; arbitrary nonlinear functions; emission probabilities; generalization; hidden Markov models; hybrid support vector machine; machine learning; parameterization; performance; speech recognition; speech recognition system; training; wavelet transform; word recognition ratio; Acoustic emission; Artificial neural networks; Automatic control; Databases; Hidden Markov models; Machine learning; Speech recognition; State estimation; Support vector machines; Wavelet transforms;

fLanguage

English

Publisher

ieee

Conference_Titel

Circuits and Systems, 2005. ISCAS 2005. IEEE International Symposium on

Print_ISBN

0-7803-8834-8

Type

conf

DOI

10.1109/ISCAS.2005.1465466

Filename

1465466