DocumentCode :
2718373
Title :
Meta-class features for large-scale object categorization on a budget
Author :
Bergamo, Alessandro ; Torresani, Lorenzo
Author_Institution :
Dartmouth Coll., Hanover, NH, USA
fYear :
2012
fDate :
16-21 June 2012
Firstpage :
3085
Lastpage :
3092
Abstract :
In this paper we introduce a novel image descriptor enabling accurate object categorization even with linear models. Akin to the popular attribute descriptors, our feature vector comprises the outputs of a set of classifiers evaluated on the image. However, unlike traditional attributes which represent hand-selected object classes and predefined visual properties, our features are learned automatically and correspond to “abstract” categories, which we name meta-classes. Each meta-class is a super-category obtained by grouping a set of object classes such that, collectively, they are easy to distinguish from other sets of categories. By using “learnability” of the meta-classes as criterion for feature generation, we obtain a set of attributes that encode general visual properties shared by multiple object classes and that are effective in describing and recognizing even novel categories, i.e., classes not present in the training set. We demonstrate that simple linear SVMs trained on our meta-class descriptor significantly outperform the best known classifier on the Caltech256 benchmark. We also present results on the 2010 ImageNet Challenge database where our system produces results approaching those of the best systems, but at a much lower computational cost.
Keywords :
feature extraction; image classification; learning (artificial intelligence); object recognition; support vector machines; 2010 ImageNet Challenge database; Caltech256 benchmark; abstract category; attribute descriptors; category description; category recognition; classifiers; feature generation; feature learning; feature vector; general visual properties encoding; image descriptor; large-scale object categorization; linear SVM; linear models; meta-class features; meta-class learnability; object class; Abstracts; Accuracy; Databases; Image recognition; Kernel; Training; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on
Conference_Location :
Providence, RI
ISSN :
1063-6919
Print_ISBN :
978-1-4673-1226-4
Electronic_ISBN :
1063-6919
Type :
conf
DOI :
10.1109/CVPR.2012.6248040
Filename :
6248040
Link To Document :
بازگشت