مرکز منطقه ای اطلاع رساني علوم و فناوري - Variational Bayes for continuous hidden Markov models and its application to active learning

DocumentCode :

834046

Title :

Variational Bayes for continuous hidden Markov models and its application to active learning

Author :

Ji, Shihao ; Krishnapuram, Balaji ; Carin, Lawrence

Author_Institution :

Dept. of Electr. & Comput. Eng., Duke Univ., Durham, NC, USA

Volume :

Issue :

fYear :

2006

fDate :

4/1/2006 12:00:00 AM

Firstpage :

522

Lastpage :

532

Abstract :

In this paper, we present a variational Bayes (VB) framework for learning continuous hidden Markov models (CHMMs), and we examine the VB framework within active learning. Unlike a maximum likelihood or maximum a posteriori training procedure, which yield a point estimate of the CHMM parameters, VB-based training yields an estimate of the full posterior of the model parameters. This is particularly important for small training sets since it gives a measure of confidence in the accuracy of the learned model. This is utilized within the context of active learning, for which we acquire labels for those feature vectors for which knowledge of the associated label would be most informative for reducing model-parameter uncertainty. Three active learning algorithms are considered in this paper: 1) query by committee (QBC), with the goal of selecting data for labeling that minimize the classification variance, 2) a maximum expected information gain method that seeks to label data with the goal of reducing the entropy of the model parameters, and 3) an error-reduction-based procedure that attempts to minimize classification error over the test data. The experimental results are presented for synthetic and measured data. We demonstrate that all of these active learning methods can significantly reduce the amount of required labeling, compared to random selection of samples for labeling.

Keywords :

Bayes methods; entropy; hidden Markov models; learning (artificial intelligence); maximum likelihood estimation; active learning; classification error minimization; classification variance minimization; continuous hidden Markov models; error-reduction-based procedure; maximum a posteriori training procedure; maximum expected information gain method; maximum likelihood training procedure; model parameter entropy reduction; model-parameter uncertainty reduction; point estimate; query by committee; variational Bayes; Bayesian methods; Context modeling; Hidden Markov models; Labeling; Maximum likelihood estimation; Parameter estimation; Particle measurements; Signal processing algorithms; Testing; Yield estimation; Variational Bayes (VB); active learning (AL); continuous hidden Markov models (CHMMs); error-reduction-based active learning.; maximum expected information gain (MEIG); query by committee (QBC); Algorithms; Artificial Intelligence; Bayes Theorem; Cluster Analysis; Computer Simulation; Information Storage and Retrieval; Markov Chains; Models, Statistical; Pattern Recognition, Automated;

fLanguage :

English

Journal_Title :

Pattern Analysis and Machine Intelligence, IEEE Transactions on

Publisher :

ieee

ISSN :

0162-8828

Type :

jour

DOI :

10.1109/TPAMI.2006.85

Filename :

1597110

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=834046