DocumentCode
1172796
Title
Graphical model architectures for speech recognition
Author
Bilmes, Jeff A. ; Bartels, Chris
Volume
22
Issue
5
fYear
2005
Firstpage
89
Lastpage
100
Abstract
This article discusses the foundations of the use of graphical models for speech recognition as presented in J. R. Deller et al. (1993), X. D. Huang et al. (2001), F. Jelinek (19970, L. R. Rabiner and B. -H. Juang (1993) and S. Young et al. (1990) giving detailed accounts of some of the more successful cases. Our discussion employs dynamic Bayesian networks (DBNs) and a DBN extension using the Graphical Model Toolkit´s (GMTK´s) basic template, a dynamic graphical model representation that is more suitable for speech and language systems. While this article concentrates on speech recognition, it should be noted that many of the ideas presented here are also applicable to natural language processing and general time-series analysis.
Keywords
belief networks; graph theory; natural languages; speech recognition; time series; dynamic Bayesian networks; dynamic graphical model representation; language systems; natural language processing; speech recognition; time-series analysis; Bayesian methods; Computer architecture; Computer science; Graphical models; Natural languages; Probability distribution; Random variables; Social network services; Speech processing; Speech recognition;
fLanguage
English
Journal_Title
Signal Processing Magazine, IEEE
Publisher
ieee
ISSN
1053-5888
Type
jour
DOI
10.1109/MSP.2005.1511827
Filename
1511827
Link To Document