• DocumentCode
    2965255
  • Title

    REMAP-experiments with speech recognition

  • Author

    Konig, Yochai ; Bourlard, Herve ; Morgan, Nelson

  • Author_Institution
    Int. Comput. Sci. Inst., Berkeley, CA, USA
  • Volume
    6
  • fYear
    1996
  • fDate
    7-10 May 1996
  • Firstpage
    3350
  • Abstract
    We present experimental and theoretical results using a framework for training and modeling continuous speech recognition systems based on the theoretically optimal maximum a posteriori (MAP) criterion. This is in contrast to most state-of-the-art systems which are trained according to a maximum likelihood (ML) criterion. Although the algorithm is quite general, we applied it to a particular form of hybrid system combining hidden Markov models (HMMs) and artificial neural networks (ANNs) in which the ANN targets and weights are iteratively reestimated to guarantee the increase of the posterior probability of the correct model, hence actually minimizing the error rate. More specifically, this training approach is applied to a transition-based model that uses local conditional transition probabilities (i.e. the posterior probability of the current state given the current acoustic vector and the previous state) to estimate the posterior probabilities of sentences. Experimental results on isolated and continuous speech recognition tasks show an increase in the estimates of posterior probabilities of the correct sentences after training, and significant decreases in error rates in comparison to a baseline system
  • Keywords
    hidden Markov models; neural nets; pattern classification; probability; speech recognition; REMAP; artificial neural networks; continuous speech recognition; hidden Markov models; isolated speech recognition tasks; local conditional transition probabilities; maximum a posteriori criterion; posterior probability; transition-based model; Computer science; Error analysis; Error correction; Hidden Markov models; Iterative algorithms; Maximum likelihood estimation; Pattern classification; Probability; Speech recognition; State estimation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
  • Conference_Location
    Atlanta, GA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-3192-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.1996.550595
  • Filename
    550595