• DocumentCode
    2163633
  • Title

    Bayesian integration of audio and visual information for multi-target tracking using a CB-member filter

  • Author

    Hoseinnezhad, Reza ; Vo, Ba-Ngu ; Vo, Ba-Tuong ; Suter, David

  • Author_Institution
    RMIT Univ., Melbourne, VIC, Australia
  • fYear
    2011
  • fDate
    22-27 May 2011
  • Firstpage
    2300
  • Lastpage
    2303
  • Abstract
    A new method is presented for integration of audio and visual information in multiple target tracking applications. The proposed approach uses a Bayesian filtering formulation and exploits multi-Bernoulli random finite set approximations. The work presented in this paper is the first principled Bayesian estimation approach to solve the sensor fusion problems that involve intermittent sensory data (e.g. audio data for a person who occasionally speaks.) We have examined our method with case studies from the SPEVI database. The results show nearly perfect tracking of people not only when they are silent but also when they are not visible to the camera (but speaking).
  • Keywords
    approximation theory; belief networks; filtering theory; sensor fusion; target tracking; Bayesian filtering formulation; Bayesian integration; CB-member filter; SPEVI database; audio information; multi Bernoulli random finite set approximation; multitarget tracking; sensor fusion; visual information; Acoustic measurements; Bayesian methods; Cameras; Predictive models; Target tracking; Visualization; Bayesian filtering; audio-visual tracking; finite set statistics; random finite sets; sensor fusion;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
  • Conference_Location
    Prague
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4577-0538-0
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2011.5946942
  • Filename
    5946942