DocumentCode
639579
Title
Improving an Object Detector and Extracting Regions Using Superpixels
Author
Guang Shu ; Dehghan, Afshin ; Shah, Mubarak
Author_Institution
Comput. Vision Lab., Univ. of Central Florida, Orlando, FL, USA
fYear
2013
fDate
23-28 June 2013
Firstpage
3721
Lastpage
3727
Abstract
We propose an approach to improve the detection performance of a generic detector when it is applied to a particular video. The performance of offline-trained objects detectors are usually degraded in unconstrained video environments due to variant illuminations, backgrounds and camera viewpoints. Moreover, most object detectors are trained using Haar-like features or gradient features but ignore video specific features like consistent color patterns. In our approach, we apply a Super pixel-based Bag-of-Words (BoW) model to iteratively refine the output of a generic detector. Compared to other related work, our method builds a video-specific detector using super pixels, hence it can handle the problem of appearance variation. Most importantly, using Conditional Random Field (CRF) along with our super pixel-based BoW model, we develop and algorithm to segment the object from the background. Therefore our method generates an output of the exact object regions instead of the bounding boxes generated by most detectors. In general, our method takes detection bounding boxes of a generic detector as input and generates the detection output with higher average precision and precise object regions. The experiments on four recent datasets demonstrate the effectiveness of our approach and significantly improves the state-of-art detector by 5-16% in average precision.
Keywords
Haar transforms; image colour analysis; image segmentation; object detection; random processes; video signal processing; BoW model; CRF; Haar-like features; appearance variation; color patterns; conditional random field; detection bounding boxes; generic detector; gradient features; object detectors; object segmentation; offline-trained objects detectors; region extraction; super pixel-based bag-of-words model; superpixels; unconstrained video environments; variant illuminations; video specific features; video-specific detector; Cameras; Computer vision; Detectors; Feature extraction; Image color analysis; Support vector machines; Training;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Vision and Pattern Recognition (CVPR), 2013 IEEE Conference on
Conference_Location
Portland, OR
ISSN
1063-6919
Type
conf
DOI
10.1109/CVPR.2013.477
Filename
6619321
Link To Document