DocumentCode
62892
Title
Fast Feature Pyramids for Object Detection
Author
Dollar, Piotr ; Appel, Ron ; Belongie, Serge ; Perona, Pietro
Author_Institution
Interactive Visual Media Group, Microsoft Res., Redmond, WA, USA
Volume
36
Issue
8
fYear
2014
fDate
Aug. 2014
Firstpage
1532
Lastpage
1545
Abstract
Multi-resolution image features may be approximated via extrapolation from nearby scales, rather than being computed explicitly. This fundamental insight allows us to design object detection algorithms that are as accurate, and considerably faster, than the state-of-the-art. The computational bottleneck of many modern detectors is the computation of features at every scale of a finely-sampled image pyramid. Our key insight is that one may compute finely sampled feature pyramids at a fraction of the cost, without sacrificing performance: for a broad family of features we find that features computed at octave-spaced scale intervals are sufficient to approximate features on a finely-sampled pyramid. Extrapolation is inexpensive as compared to direct feature computation. As a result, our approximation yields considerable speedups with negligible loss in detection accuracy. We modify three diverse visual recognition systems to use fast feature pyramids and show results on both pedestrian detection (measured on the Caltech, INRIA, TUD-Brussels and ETH data sets) and general object detection (measured on the PASCAL VOC). The approach is general and is widely applicable to vision algorithms requiring fine-grained multi-scale analysis. Our approximation is valid for images with broad spectra (most natural images) and fails for images with narrow band-pass spectra (e.g., periodic textures).
Keywords
approximation theory; computer vision; extrapolation; feature extraction; object detection; object recognition; statistical analysis; Caltech data set; ETH data set; INRIA data set; PASCAL VOC data set; TUD-Brussels data set; approximation; broad spectra; detection accuracy; extrapolation; fast feature pyramids; fine-grained multiscale analysis; finely-sampled image pyramid; multiresolution image features; narrow band-pass spectra; object detection algorithms; octave-spaced scale intervals; pedestrian detection; vision algorithms; visual recognition systems; Accuracy; Approximation methods; Detectors; Feature extraction; Histograms; Object detection; Visualization; Visual features; image pyramids; natural image statistics; object detection; pedestrian detection; real-time systems;
fLanguage
English
Journal_Title
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher
ieee
ISSN
0162-8828
Type
jour
DOI
10.1109/TPAMI.2014.2300479
Filename
6714453
Link To Document