• DocumentCode
    1059972
  • Title

    Switching Auxiliary Chains for Speech Recognition

  • Author

    Lin, Hui ; Ou, Zhijian

  • Author_Institution
    Tsinghua Univ., Beijing
  • Volume
    14
  • Issue
    8
  • fYear
    2007
  • Firstpage
    568
  • Lastpage
    571
  • Abstract
    This letter investigates the problem of incorporating auxiliary information, e.g., pitch, zero crossing rate (ZCR), and rate-of-speech (ROS), for speech recognition using dynamic Bayesian networks. In this letter, we propose switching auxiliary chains for exploiting different auxiliary information tailored to different phonetic states. The switching function can be specified by a priori knowledge or, more flexibly, be learned from data with information-theoretic dependency selection. Experiments on the OGI Numbers database show that the new model achieves 7% word-error-rate relative reduction by jointly exploiting pitch, ZCR, and ROS, while keeping almost the same parameter size as the standard HMM.
  • Keywords
    Bayes methods; speech recognition; auxiliary information; dynamic Bayesian networks; phonetic states; speech recognition; switching auxiliary chains; word-error-rate relative reduction; Automatic speech recognition; Bayesian methods; Degradation; Hidden Markov models; Quantization; Random variables; Robustness; Spatial databases; Speech recognition; Auxiliary features; dynamic Bayesian networks (DBNs); speech recognition;
  • fLanguage
    English
  • Journal_Title
    Signal Processing Letters, IEEE
  • Publisher
    ieee
  • ISSN
    1070-9908
  • Type

    jour

  • DOI
    10.1109/LSP.2006.891314
  • Filename
    4276738