• DocumentCode
    3784923
  • Title

    Boosted learning in dynamic Bayesian networks for multimodal speaker detection

  • Author

    A. Garg;V. Pavlovic;J.M. Rehg

  • Author_Institution
    IBM Almaden Res. Center, San Jose, CA, USA
  • Volume
    91
  • Issue
    9
  • fYear
    2003
  • Firstpage
    1355
  • Lastpage
    1369
  • Abstract
    Bayesian network models provide an attractive framework for multimodal sensor fusion. They combine an intuitive graphical representation with efficient algorithms for inference and learning. However, the unsupervised nature of standard parameter learning algorithms for Bayesian networks can lead to poor performance in classification tasks. We have developed a supervised learning framework for Bayesian networks, which is based on the Adaboost algorithm of Schapire and Freund. Our framework covers static and dynamic Bayesian networks with both discrete and continuous states. We have tested our framework in the context of a novel multimodal HCI application: a speech-based command and control interface for a Smart Kiosk. We provide experimental evidence for the utility of our boosted learning approach.
  • Keywords
    "Intelligent networks","Bayesian methods","Multimodal sensors","Supervised learning","Testing","Human computer interaction","Command and control systems","Boosting","User interfaces","Speech"
  • Journal_Title
    Proceedings of the IEEE
  • Publisher
    ieee
  • ISSN
    0018-9219
  • Type

    jour

  • DOI
    10.1109/JPROC.2003.817119
  • Filename
    1230214