DocumentCode
1059972
Title
Switching Auxiliary Chains for Speech Recognition
Author
Lin, Hui ; Ou, Zhijian
Author_Institution
Tsinghua Univ., Beijing
Volume
14
Issue
8
fYear
2007
Firstpage
568
Lastpage
571
Abstract
This letter investigates the problem of incorporating auxiliary information, e.g., pitch, zero crossing rate (ZCR), and rate-of-speech (ROS), for speech recognition using dynamic Bayesian networks. In this letter, we propose switching auxiliary chains for exploiting different auxiliary information tailored to different phonetic states. The switching function can be specified by a priori knowledge or, more flexibly, be learned from data with information-theoretic dependency selection. Experiments on the OGI Numbers database show that the new model achieves 7% word-error-rate relative reduction by jointly exploiting pitch, ZCR, and ROS, while keeping almost the same parameter size as the standard HMM.
Keywords
Bayes methods; speech recognition; auxiliary information; dynamic Bayesian networks; phonetic states; speech recognition; switching auxiliary chains; word-error-rate relative reduction; Automatic speech recognition; Bayesian methods; Degradation; Hidden Markov models; Quantization; Random variables; Robustness; Spatial databases; Speech recognition; Auxiliary features; dynamic Bayesian networks (DBNs); speech recognition;
fLanguage
English
Journal_Title
Signal Processing Letters, IEEE
Publisher
ieee
ISSN
1070-9908
Type
jour
DOI
10.1109/LSP.2006.891314
Filename
4276738
Link To Document