DocumentCode :
35702
Title :
Large-Scale Weakly Supervised Object Localization via Latent Category Learning
Author :
Chong Wang ; Kaiqi Huang ; Weiqiang Ren ; Junge Zhang ; Maybank, Steve
Author_Institution :
Nat. Lab. of Pattern Recognition, Inst. of Autom., Beijing, China
Volume :
24
Issue :
4
fYear :
2015
fDate :
Apr-15
Firstpage :
1371
Lastpage :
1385
Abstract :
Localizing objects in cluttered backgrounds is challenging under large-scale weakly supervised conditions. Due to the cluttered image condition, objects usually have large ambiguity with backgrounds. Besides, there is also a lack of effective algorithm for large-scale weakly supervised localization in cluttered backgrounds. However, backgrounds contain useful latent information, e.g., the sky in the aeroplane class. If this latent information can be learned, object-background ambiguity can be largely reduced and background can be suppressed effectively. In this paper, we propose the latent category learning (LCL) in large-scale cluttered conditions. LCL is an unsupervised learning method which requires only image-level class labels. First, we use the latent semantic analysis with semantic object representation to learn the latent categories, which represent objects, object parts or backgrounds. Second, to determine which category contains the target object, we propose a category selection strategy by evaluating each category´s discrimination. Finally, we propose the online LCL for use in large-scale conditions. Evaluation on the challenging PASCAL Visual Object Class (VOC) 2007 and the large-scale imagenet large-scale visual recognition challenge 2013 detection data sets shows that the method can improve the annotation precision by 10% over previous methods. More importantly, we achieve the detection precision which outperforms previous results by a large margin and can be competitive to the supervised deformable part model 5.0 baseline on both data sets.
Keywords :
computer vision; image representation; unsupervised learning; ImageNet Large Scale Visual Recognition Challenge; PASCAL Visual Object Class 2007; annotation precision improvement; category discrimination evaluation; category selection strategy; cluttered background suppression; cluttered image condition; image-level class labels; large-scale cluttered conditions; large-scale weakly supervised object localization; latent category learning; latent semantic analysis; object-background ambiguity reduction; online LCL; semantic object representation; unsupervised learning method; Feature extraction; Histograms; Proposals; Search problems; Semantics; Training; Visualization; Weakly supervised learning; analysis; large-scale; latent semantic; latent semantic analysis; object localization; weakly supervised learning;
fLanguage :
English
Journal_Title :
Image Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1057-7149
Type :
jour
DOI :
10.1109/TIP.2015.2396361
Filename :
7021890
Link To Document :
بازگشت