• DocumentCode
    3546858
  • Title

    Wavelet transform to hybrid support vector machine and hidden Markov model for speech recognition

  • Author

    Shao, Yu ; Chang, Chip-Hong

  • Author_Institution
    Center for Integrated Circuits & Syst., Nanyang Technol. Univ., Singapore
  • fYear
    2005
  • fDate
    23-26 May 2005
  • Firstpage
    3833
  • Abstract
    Recent work in machine learning has focused on models, such as the support vector machine (SVM), that automatically control generalization and parameterization as part of the overall optimization process. In this paper we construct a wavelet transform to hybrid support vector machine and hidden Markov models (WSVM/HMM) speech recognition system to deal with arbitrary nonlinear functions. The proposed hybrid system has overcome the problem of acoustic modeling in state-of-the-art speech recognition systems that usually relies on HMM with Gaussian emission densities. HMM suffer from intrinsic limitations, mainly due to their arbitrary parametric assumption. Artificial neural networks appear to be a promising alternative, but they have historically failed as a general solution to the acoustic modeling problem. The proposed system has successfully exploited the benefits of the wavelet technique, HMM and SVM in a unified framework. In this hybrid system, the WSVM is trained to estimate the emission probabilities of the states of the HMM. Simulations on benchmark databases show that speech recognition systems built around the hybrid WSVM/HMM provide excellent word recognition ratio and have performance superior to many other systems.
  • Keywords
    generalisation (artificial intelligence); hidden Markov models; learning (artificial intelligence); nonlinear functions; speech recognition; support vector machines; wavelet transforms; SVM; WSVM/HMM; acoustic modeling; arbitrary nonlinear functions; emission probabilities; generalization; hidden Markov models; hybrid support vector machine; machine learning; parameterization; performance; speech recognition; speech recognition system; training; wavelet transform; word recognition ratio; Acoustic emission; Artificial neural networks; Automatic control; Databases; Hidden Markov models; Machine learning; Speech recognition; State estimation; Support vector machines; Wavelet transforms;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Circuits and Systems, 2005. ISCAS 2005. IEEE International Symposium on
  • Print_ISBN
    0-7803-8834-8
  • Type

    conf

  • DOI
    10.1109/ISCAS.2005.1465466
  • Filename
    1465466