Switching Auxiliary Chains for Speech Recognition based on Dynamic Bayesian Networks

Author

Lin, Hui ; Ou, Zhijian

Author_Institution

Dept. of Electron. Eng., Tsinghua Univ., Beijing

Volume

4

fYear

0

fDate

0-0 0

Firstpage

258

Lastpage

261

Abstract

This paper investigates the problem of incorporating auxiliary information (e.g. pitch) for speech recognition using dynamic Bayesian networks (DBNs). Previous works usually model acoustic features conditional on the pitch auxiliary variable for both voiced and unvoiced phonetic states, and therefore ignore the fact that pitch (frequency) information is meaningful only for voiced states. In this paper we propose a switching two auxiliary chain model tailored to voiced/unvoiced states for exploiting pitch information, which is essentially built on the switching parent functionality of Bayesian multinets. Experiments on the OGI Numbers database show that significant performance improvements are achieved from switching auxiliary chain modeling, compared with regular auxiliary chain modeling and the standard HMM

Keywords

belief networks; speech recognition; Bayesian multinets; dynamic Bayesian networks; pitch; speech recognition; switching auxiliary chain modeling; Acoustical engineering; Automatic speech recognition; Bayesian methods; Frequency; Hidden Markov models; Random variables; Spatial databases; Speech enhancement; Speech recognition; Switches;

fLanguage

English

Publisher

ieee

Conference_Titel

Pattern Recognition, 2006. ICPR 2006. 18th International Conference on

Conference_Location

Hong Kong

ISSN

1051-4651

Print_ISBN

0-7695-2521-0

Type

conf

DOI

10.1109/ICPR.2006.1098

Filename

1699829