• DocumentCode
    417302
  • Title

    Asynchronous HMM with applications to speech recognition

  • Author

    Garg, Ashutosh ; Balakrishnan, Sreeram ; Vaithyanathan, Shivakumar

  • Author_Institution
    Almaden Res. Center, San Jose, CA, USA
  • Volume
    1
  • fYear
    2004
  • fDate
    17-21 May 2004
  • Abstract
    We develop a novel formalism for modeling speech signals which are irregularly or incompletely sampled. This situation can arise in real world applications where the speech signal is being transmitted over an error prone channel where parts of the signal can be dropped. Typical speech systems based on hidden Markov models, cannot handle such data since HMMs rely on the assumption that observations are complete and made at regular intervals. We introduce the asynchronous HMM, a variant of the inhomogeneous HMM commonly used in bioinformatics, and show how it can be used to model irregularly or incompletely sampled data. A nested EM algorithm is presented in brief which can be used to learn the parameters of this asynchronous HMM. Evaluation on real world speech data, which has been modified to simulate channel errors, shows that this model and its variants significantly outperform the standard HMM and methods based on data interpolation.
  • Keywords
    hidden Markov models; learning (artificial intelligence); optimisation; signal sampling; speech recognition; asynchronous HMM; data interpolation; error prone channel; hidden Markov models; incomplete sampling; inhomogeneous HMM; irregular sampling; nested EM algorithm; speech recognition; speech signal modeling; Bioinformatics; Computational modeling; Computer errors; Computer vision; Hidden Markov models; Interpolation; Random variables; Speech analysis; Speech recognition; Text analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8484-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2004.1326159
  • Filename
    1326159