DocumentCode :
1059972
Title :
Switching Auxiliary Chains for Speech Recognition
Author :
Lin, Hui ; Ou, Zhijian
Author_Institution :
Tsinghua Univ., Beijing
Volume :
14
Issue :
8
fYear :
2007
Firstpage :
568
Lastpage :
571
Abstract :
This letter investigates the problem of incorporating auxiliary information, e.g., pitch, zero crossing rate (ZCR), and rate-of-speech (ROS), for speech recognition using dynamic Bayesian networks. In this letter, we propose switching auxiliary chains for exploiting different auxiliary information tailored to different phonetic states. The switching function can be specified by a priori knowledge or, more flexibly, be learned from data with information-theoretic dependency selection. Experiments on the OGI Numbers database show that the new model achieves 7% word-error-rate relative reduction by jointly exploiting pitch, ZCR, and ROS, while keeping almost the same parameter size as the standard HMM.
Keywords :
Bayes methods; speech recognition; auxiliary information; dynamic Bayesian networks; phonetic states; speech recognition; switching auxiliary chains; word-error-rate relative reduction; Automatic speech recognition; Bayesian methods; Degradation; Hidden Markov models; Quantization; Random variables; Robustness; Spatial databases; Speech recognition; Auxiliary features; dynamic Bayesian networks (DBNs); speech recognition;
fLanguage :
English
Journal_Title :
Signal Processing Letters, IEEE
Publisher :
ieee
ISSN :
1070-9908
Type :
jour
DOI :
10.1109/LSP.2006.891314
Filename :
4276738
Link To Document :
بازگشت