• DocumentCode
    294537
  • Title

    Robust speech recognition based on stochastic matching

  • Author

    Sankar, Ananth ; Lee, Chin-Hui

  • Author_Institution
    AT&T Bell Labs., Murray Hill, NJ, USA
  • Volume
    1
  • fYear
    1995
  • fDate
    9-12 May 1995
  • Firstpage
    121
  • Abstract
    We present a maximum likelihood (ML) stochastic matching approach to decrease the acoustic mismatch between a test utterance Y and a given set of speech hidden Markov models ΛX so as to reduce the recognition performance degradation caused by possible distortions in the test utterance. This mismatch may be reduced in two ways: (1) by an inverse distortion function Fν(.) that maps Y into an utterance X which matches better with the models ΛX, and (2) by a model transformation function Gη(.) that maps ΛX to the transformed model ΛY which matches better with the utterance Y. The functional form of the transformations depends upon our prior knowledge about the mismatch, and the parameters are estimated along with the recognized string in a maximum likelihood manner using the EM algorithm. Experimental results verify the efficacy of the approach in improving the performance of a continuous speech recognition system in the presence of mismatch due to different transducers and transmission channels
  • Keywords
    acoustic signal processing; hidden Markov models; inverse problems; maximum likelihood estimation; speech recognition; stochastic processes; telecommunication channels; EM algorithm; HMM; acoustic mismatch; continuous speech recognition system; experimental results; hidden Markov models; inverse distortion function; maximum likelihood stochastic matching; model transformation function; parameter estimation; recognition performance; robust speech recognition; speech models; stochastic matching; test utterance distortions; transducers; transformed model; transmission channels; Acoustic distortion; Acoustic testing; Degradation; Hidden Markov models; Maximum likelihood estimation; Predistortion; Robustness; Speech recognition; Stochastic processes; Time of arrival estimation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
  • Conference_Location
    Detroit, MI
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-2431-5
  • Type

    conf

  • DOI
    10.1109/ICASSP.1995.479288
  • Filename
    479288