• DocumentCode
    3228990
  • Title

    A speech recognizer using radial basis function neural networks in an HMM framework

  • Author

    Singer, Elliot ; Lippman, R.P.

  • Author_Institution
    MIT Lincoln Lab., Lexington, MA, USA
  • Volume
    1
  • fYear
    1992
  • fDate
    23-26 Mar 1992
  • Firstpage
    629
  • Abstract
    A high performance speaker-independent isolated-word speech recognizer was developed which combines hidden Markov models (HMMs) and radial basis function (RBF) neural networks. RBF networks in this recognizer use discriminant training techniques to estimate Bayesian probabilities for each speech frame while HMM decoders estimate overall word likelihood scores for network outputs. RBF training is performed after the HMM recognizer has automatically segmented training tokens using forced Viterbi alignment. In recognition experiments using a speaker-independent E-set database, the hybrid recognizer had an error rate of 11.5% compared to 15.7% for the robust unimodal Gaussian HMM recognizer upon which the hybrid system was based. The error rate was also lower than that of a tied-mixture HMM recognizer with the same number of centers. These results demonstrate that RBF networks can be successfully incorporated in hybrid recognizers and suggest that they may be capable of good performance with fewer parameters than required by Gaussian mixture classifiers
  • Keywords
    error statistics; hidden Markov models; neural nets; speech recognition equipment; Bayesian probabilities; HMM framework; RBF networks; discriminant training techniques; error rate; forced Viterbi alignment; hidden Markov models; performance; radial basis function neural networks; speaker-independent E-set database; speaker-independent isolated-word speech recognizer; speech frame; unimodal Gaussian recognition; Automatic speech recognition; Bayesian methods; Databases; Decoding; Error analysis; Hidden Markov models; Neural networks; Radial basis function networks; Speech recognition; Viterbi algorithm;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
  • Conference_Location
    San Francisco, CA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-0532-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.1992.225830
  • Filename
    225830