DocumentCode
2608417
Title
Switching Auxiliary Chains for Speech Recognition based on Dynamic Bayesian Networks
Author
Lin, Hui ; Ou, Zhijian
Author_Institution
Dept. of Electron. Eng., Tsinghua Univ., Beijing
Volume
4
fYear
0
fDate
0-0 0
Firstpage
258
Lastpage
261
Abstract
This paper investigates the problem of incorporating auxiliary information (e.g. pitch) for speech recognition using dynamic Bayesian networks (DBNs). Previous works usually model acoustic features conditional on the pitch auxiliary variable for both voiced and unvoiced phonetic states, and therefore ignore the fact that pitch (frequency) information is meaningful only for voiced states. In this paper we propose a switching two auxiliary chain model tailored to voiced/unvoiced states for exploiting pitch information, which is essentially built on the switching parent functionality of Bayesian multinets. Experiments on the OGI Numbers database show that significant performance improvements are achieved from switching auxiliary chain modeling, compared with regular auxiliary chain modeling and the standard HMM
Keywords
belief networks; speech recognition; Bayesian multinets; dynamic Bayesian networks; pitch; speech recognition; switching auxiliary chain modeling; Acoustical engineering; Automatic speech recognition; Bayesian methods; Frequency; Hidden Markov models; Random variables; Spatial databases; Speech enhancement; Speech recognition; Switches;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition, 2006. ICPR 2006. 18th International Conference on
Conference_Location
Hong Kong
ISSN
1051-4651
Print_ISBN
0-7695-2521-0
Type
conf
DOI
10.1109/ICPR.2006.1098
Filename
1699829
Link To Document