• DocumentCode
    3001944
  • Title

    Optimal scanning for faster object detection

  • Author

    Butko, Nicholas J ; Movellan, Javier R.

  • Author_Institution
    Dept. of Cognitive Sci., UC San Diego, La Jolla, CA, USA
  • fYear
    2009
  • fDate
    20-25 June 2009
  • Firstpage
    2751
  • Lastpage
    2758
  • Abstract
    Recent years have seen the development of fast and accurate algorithms for detecting objects in images. However, as the size of the scene grows, so do the running-times of these algorithms. If a 128×102 pixel image requires 20 ms to process, searching for objects in a 1280×1024 image will take 2 s. This is unsuitable under real-time operating constraints: by the time a frame has been processed, the object may have moved. An analogous problem occurs when controlling robot camera that need to scan scenes in search of target objects. In this paper, we consider a method for improving the run-time of general-purpose object-detection algorithms. Our method is based on a model of visual search in humans, which schedules eye fixations to maximize the long-term information accrued about the location of the target of interest. The approach can be used to drive robot cameras that physically scan scenes or to improve the scanning speed for very large high resolution images. We consider the latter application in this work by simulating a “digital fovea” and sequentially placing it in various regions of an image in a way that maximizes the expected information gain. We evaluate the approach using the OpenCV version of the Viola-Jones face detector. After accounting for all computational overhead introduced by the fixation controller, the approach doubles the speed of the standard Viola-Jones detector at little cost in accuracy.
  • Keywords
    image resolution; object detection; OpenCV version; Viola-Jones face detector; digital fovea; general-purpose object-detection algorithms; image resolution; long-term information; optimal scanning; pixel image; robot camera; time 2 s; time 20 ms; visual search; Cameras; Detectors; Face detection; Layout; Object detection; Pixel; Robot control; Robot vision systems; Runtime; Time factors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on
  • Conference_Location
    Miami, FL
  • ISSN
    1063-6919
  • Print_ISBN
    978-1-4244-3992-8
  • Type

    conf

  • DOI
    10.1109/CVPR.2009.5206540
  • Filename
    5206540