DocumentCode :
3050403
Title :
Vision-based speaker detection using Bayesian networks
Author :
Rehg, James M. ; Murphy, Kevin P. ; Fieguth, Paul W.
Author_Institution :
Cambridge Res. Lab., Compaq Comput. Corp., MA, USA
Volume :
2
fYear :
1999
fDate :
1999
Abstract :
The development of user interfaces based on vision and speech requires the solution of a challenging statistical inference problem: The intentions and actions of multiple individuals must be inferred from noisy and ambiguous data. We argue that Bayesian network models are an attractive statistical framework for cue fusion in these applications. Bayes nets combine a natural mechanism for expressing contextual information with efficient algorithms for learning and inference. We illustrate these points through the development of a Bayes net model for detecting when a user is speaking. The model combines four simple vision sensors: face detection, skin color, skin texture, and mouth motion. We present some promising experimental results
Keywords :
belief networks; computer vision; user interfaces; Bayesian networks; speaker detection; user interfaces; vision sensors; Application software; Bayesian methods; Computer networks; Computer vision; Face detection; Inference algorithms; Skin; Speech; Testing; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision and Pattern Recognition, 1999. IEEE Computer Society Conference on.
Conference_Location :
Fort Collins, CO
ISSN :
1063-6919
Print_ISBN :
0-7695-0149-4
Type :
conf
DOI :
10.1109/CVPR.1999.784617
Filename :
784617
Link To Document :
بازگشت