DocumentCode
3001944
Title
Optimal scanning for faster object detection
Author
Butko, Nicholas J ; Movellan, Javier R.
Author_Institution
Dept. of Cognitive Sci., UC San Diego, La Jolla, CA, USA
fYear
2009
fDate
20-25 June 2009
Firstpage
2751
Lastpage
2758
Abstract
Recent years have seen the development of fast and accurate algorithms for detecting objects in images. However, as the size of the scene grows, so do the running-times of these algorithms. If a 128×102 pixel image requires 20 ms to process, searching for objects in a 1280×1024 image will take 2 s. This is unsuitable under real-time operating constraints: by the time a frame has been processed, the object may have moved. An analogous problem occurs when controlling robot camera that need to scan scenes in search of target objects. In this paper, we consider a method for improving the run-time of general-purpose object-detection algorithms. Our method is based on a model of visual search in humans, which schedules eye fixations to maximize the long-term information accrued about the location of the target of interest. The approach can be used to drive robot cameras that physically scan scenes or to improve the scanning speed for very large high resolution images. We consider the latter application in this work by simulating a “digital fovea” and sequentially placing it in various regions of an image in a way that maximizes the expected information gain. We evaluate the approach using the OpenCV version of the Viola-Jones face detector. After accounting for all computational overhead introduced by the fixation controller, the approach doubles the speed of the standard Viola-Jones detector at little cost in accuracy.
Keywords
image resolution; object detection; OpenCV version; Viola-Jones face detector; digital fovea; general-purpose object-detection algorithms; image resolution; long-term information; optimal scanning; pixel image; robot camera; time 2 s; time 20 ms; visual search; Cameras; Detectors; Face detection; Layout; Object detection; Pixel; Robot control; Robot vision systems; Runtime; Time factors;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on
Conference_Location
Miami, FL
ISSN
1063-6919
Print_ISBN
978-1-4244-3992-8
Type
conf
DOI
10.1109/CVPR.2009.5206540
Filename
5206540
Link To Document