Improving an Object Detector and Extracting Regions Using Superpixels

Author

Guang Shu ; Dehghan, Afshin ; Shah, Mubarak

Author_Institution

Comput. Vision Lab., Univ. of Central Florida, Orlando, FL, USA

fYear

2013

fDate

23-28 June 2013

Firstpage

3721

Lastpage

3727

Abstract

We propose an approach to improve the detection performance of a generic detector when it is applied to a particular video. The performance of offline-trained objects detectors are usually degraded in unconstrained video environments due to variant illuminations, backgrounds and camera viewpoints. Moreover, most object detectors are trained using Haar-like features or gradient features but ignore video specific features like consistent color patterns. In our approach, we apply a Super pixel-based Bag-of-Words (BoW) model to iteratively refine the output of a generic detector. Compared to other related work, our method builds a video-specific detector using super pixels, hence it can handle the problem of appearance variation. Most importantly, using Conditional Random Field (CRF) along with our super pixel-based BoW model, we develop and algorithm to segment the object from the background. Therefore our method generates an output of the exact object regions instead of the bounding boxes generated by most detectors. In general, our method takes detection bounding boxes of a generic detector as input and generates the detection output with higher average precision and precise object regions. The experiments on four recent datasets demonstrate the effectiveness of our approach and significantly improves the state-of-art detector by 5-16% in average precision.

Keywords

Haar transforms; image colour analysis; image segmentation; object detection; random processes; video signal processing; BoW model; CRF; Haar-like features; appearance variation; color patterns; conditional random field; detection bounding boxes; generic detector; gradient features; object detectors; object segmentation; offline-trained objects detectors; region extraction; super pixel-based bag-of-words model; superpixels; unconstrained video environments; variant illuminations; video specific features; video-specific detector; Cameras; Computer vision; Detectors; Feature extraction; Image color analysis; Support vector machines; Training;

fLanguage

English

Publisher

ieee

Conference_Titel

Computer Vision and Pattern Recognition (CVPR), 2013 IEEE Conference on

Conference_Location

Portland, OR

ISSN

1063-6919

Type

conf

DOI

10.1109/CVPR.2013.477

Filename

6619321