• DocumentCode
    784130
  • Title

    Person Surveillance Using Visual and Infrared Imagery

  • Author

    Krotosky, Stephen J. ; Trivedi, Mohan Manubhai

  • Author_Institution
    Adv. Multimedia & Signal Process. Div., Sci. Applic. Int. Corp. (SAIC), San Diego, CA
  • Volume
    18
  • Issue
    8
  • fYear
    2008
  • Firstpage
    1096
  • Lastpage
    1105
  • Abstract
    This paper presents a methodology for analyzing multimodal and multiperspective systems for person surveillance. Using an experimental testbed consisting of two color and two infrared cameras, we can accurately register the color and infrared imagery for any general scene configuration, expanding the scope of multispectral analysis beyond the specialized long-range surveillance experiments of previous approaches to more general scene configurations common to unimodal approaches. We design an algorithmic framework for detecting people in a scene that can be generalized to include color, infrared, and/or disparity features. Using a combination of a histogram of oriented gradient (HOG) feature-based support vector machine and size/depth-based constraints, we create a probabilistic score for evaluating the presence of people. Using this framework, we train person detectors using color stereo and infrared stereo features as well as tetravision-based detectors that combine the detector outputs from separately trained color stereo and infrared stereo-based detectors. Additionally, we incorporate the trifocal tensor in order to combine the color and infrared features in a unified detection framework and use these trained detectors for an experimental evaluation of video sequences captured with our designed testbed. Our evaluation definitively demonstrates the performance gains achievable when using the trifocal framework to combine color and infrared features in a unified framework. Both of the trifocal setups outperform their unimodal equivalents, as well as the tetravision-based analysis. Our experiments also demonstrate how the trained detector generalizes well to different scenes and can provide robust input to an additional tracking framework.
  • Keywords
    computer vision; feature extraction; gradient methods; image colour analysis; image registration; infrared imaging; learning (artificial intelligence); object detection; probability; spectral analysis; stereo image processing; support vector machines; surveillance; computer vision; image color analysis; image registration; infrared imagery; machine learning; multispectral analysis; object detection; oriented gradient feature extraction; person surveillance; probability; stereo image processing; support vector machine; trifocal tensor; video sequence; visual imagery;
  • fLanguage
    English
  • Journal_Title
    Circuits and Systems for Video Technology, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1051-8215
  • Type

    jour

  • DOI
    10.1109/TCSVT.2008.928217
  • Filename
    4559595