• DocumentCode
    3525949
  • Title

    Audio-assisted trajectory estimation in non-overlapping multi-camera networks

  • Author

    Taj, Murtaza ; Cavallaro, Andrea

  • Author_Institution
    Multimedia & Vision Group, Queen Mary Univ. of London, London
  • fYear
    2009
  • fDate
    19-24 April 2009
  • Firstpage
    3517
  • Lastpage
    3520
  • Abstract
    We present an algorithm to improve trajectory estimation in networks of non-overlapping cameras using audio measurements. The algorithm fuses audiovisual cues in each camera´s field of view and recovers trajectories in unobserved regions using microphones only. Audio source localization is performed using stereo audio and cycloptic vision (STAC) sensor by estimating the time difference of arrival (TDOA) between microphone pair and then by computing the cross correlation. Audio estimates are then smoothed using Kalman filtering. The audio-visual fusion is performed using a dynamic weighting strategy. We show that using a multi-modal sensor with combined visual (narrow) and audio (wider) field of view can enable extended target tracking in non-overlapping camera settings. In particular, the weighting scheme improves performance in the overlapping regions. The algorithm is evaluated in several multi-sensor configurations using synthetic data and compared with state of the art algorithm.
  • Keywords
    Kalman filters; audio signal processing; image sensors; microphones; parameter estimation; sensor fusion; smoothing methods; target tracking; Kalman filtering; TDOA; audio measurements; audio source localization; audio-assisted trajectory estimation; audio-visual fusion; camera field of view; cross correlation; cycloptic vision sensor; dynamic weighting strategy; extended target tracking; microphones; multimodal sensor; multisensor configurations; nonoverlapping multicamera networks; smoothing method; stereo audio; time difference of arrival estimation; Acoustic sensors; Array signal processing; Cameras; Delay estimation; Microphones; Reverberation; Sensor fusion; Surveillance; Target tracking; Time difference of arrival; TDOA; fusion; tracking;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
  • Conference_Location
    Taipei
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-2353-8
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2009.4960384
  • Filename
    4960384