مرکز منطقه ای اطلاع رساني علوم و فناوري - Boosting and structure learning in dynamic Bayesian networks for audio-visual speaker detection

DocumentCode :

3614098

Title :

Boosting and structure learning in dynamic Bayesian networks for audio-visual speaker detection

Author :

T. Choudhury;J.M. Rehg;V. Pavlovic;A. Pentland

Author_Institution :

Media Lab., MIT, Cambridge, MA, USA

Volume :

fYear :

2002

fDate :

6/24/1905 12:00:00 AM

Firstpage :

789

Abstract :

Bayesian networks are an attractive modeling tool for human sensing, as they combine an intuitive graphical representation with efficient algorithms for inference and learning. Earlier work has demonstrated that boosted parameter learning could be used to improve the performance of Bayesian network, classifiers for complex multi-modal inference problems such as speaker detection. In speaker detection, the goal is to use video and audio cites to infer when a person is speaking to a user interface. In this paper we introduce a new boosted structure learning algorithm based on AdaBoost. Given labeled data, our algorithm modifies both the network structure and parameters so as to improve classification accuracy. We compare its performance to both standard structure learning and boosted parameter learning on a fixed structure. We present results for speaker detection and for the UCI "chess" dataset.

Keywords :

"Boosting","Intelligent networks","Bayesian methods","Speech","Testing","Lips","Laboratories","Educational institutions","Computer networks","Petroleum"

Publisher :

ieee

Conference_Titel :

Pattern Recognition, 2002. Proceedings. 16th International Conference on

ISSN :

1051-4651

Print_ISBN :

0-7695-1695-X

Type :

conf

DOI :

10.1109/ICPR.2002.1048137

Filename :

1048137

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3614098