Title :
Improving an Object Detector and Extracting Regions Using Superpixels
Author :
Guang Shu ; Dehghan, Afshin ; Shah, Mubarak
Author_Institution :
Comput. Vision Lab., Univ. of Central Florida, Orlando, FL, USA
Abstract :
We propose an approach to improve the detection performance of a generic detector when it is applied to a particular video. The performance of offline-trained objects detectors are usually degraded in unconstrained video environments due to variant illuminations, backgrounds and camera viewpoints. Moreover, most object detectors are trained using Haar-like features or gradient features but ignore video specific features like consistent color patterns. In our approach, we apply a Super pixel-based Bag-of-Words (BoW) model to iteratively refine the output of a generic detector. Compared to other related work, our method builds a video-specific detector using super pixels, hence it can handle the problem of appearance variation. Most importantly, using Conditional Random Field (CRF) along with our super pixel-based BoW model, we develop and algorithm to segment the object from the background. Therefore our method generates an output of the exact object regions instead of the bounding boxes generated by most detectors. In general, our method takes detection bounding boxes of a generic detector as input and generates the detection output with higher average precision and precise object regions. The experiments on four recent datasets demonstrate the effectiveness of our approach and significantly improves the state-of-art detector by 5-16% in average precision.
Keywords :
Haar transforms; image colour analysis; image segmentation; object detection; random processes; video signal processing; BoW model; CRF; Haar-like features; appearance variation; color patterns; conditional random field; detection bounding boxes; generic detector; gradient features; object detectors; object segmentation; offline-trained objects detectors; region extraction; super pixel-based bag-of-words model; superpixels; unconstrained video environments; variant illuminations; video specific features; video-specific detector; Cameras; Computer vision; Detectors; Feature extraction; Image color analysis; Support vector machines; Training;
Conference_Titel :
Computer Vision and Pattern Recognition (CVPR), 2013 IEEE Conference on
Conference_Location :
Portland, OR
DOI :
10.1109/CVPR.2013.477