DocumentCode :
1418888
Title :
Scalable Active Learning for Multiclass Image Classification
Author :
Joshi, A.J. ; Porikli, F. ; Papanikolopoulos, N.P.
Author_Institution :
Google, Inc., Mountain View, CA, USA
Volume :
34
Issue :
11
fYear :
2012
Firstpage :
2259
Lastpage :
2273
Abstract :
Machine learning techniques for computer vision applications like object recognition, scene classification, etc., require a large number of training samples for satisfactory performance. Especially when classification is to be performed over many categories, providing enough training samples for each category is infeasible. This paper describes new ideas in multiclass active learning to deal with the training bottleneck, making it easier to train large multiclass image classification systems. First, we propose a new interaction modality for training which requires only yes-no type binary feedback instead of a precise category label. The modality is especially powerful in the presence of hundreds of categories. For the proposed modality, we develop a Value-of-Information (VOI) algorithm that chooses informative queries while also considering user annotation cost. Second, we propose an active selection measure that works with many categories and is extremely fast to compute. This measure is employed to perform a fast seed search before computing VOI, resulting in an algorithm that scales linearly with dataset size. Third, we use locality sensitive hashing to provide a very fast approximation to active learning, which gives sublinear time scaling, allowing application to very large datasets. The approximation provides up to two orders of magnitude speedups with little loss in accuracy. Thorough empirical evaluation of classification accuracy, noise sensitivity, imbalanced data, and computational performance on a diverse set of image datasets demonstrates the strengths of the proposed algorithms.
Keywords :
feedback; image classification; learning (artificial intelligence); VOI algorithm; active selection measure; category label; classification accuracy; computer vision; image dataset; informative query; interaction modality; locality sensitive hashing; machine learning technique; magnitude speedup; multiclass active learning; multiclass image classification; noise sensitivity; object recognition; scalable active learning; scene classification; sublinear time scaling; user annotation cost; value-of-information; yes-no type binary feedback; Accuracy; Couplings; Learning systems; Noise; Support vector machines; Training; Training data; Active learning; multiclass classification; object recognition; scalable machine learning; Algorithms; Artificial Intelligence; Image Enhancement; Image Interpretation, Computer-Assisted; Pattern Recognition, Automated; Reproducibility of Results; Sensitivity and Specificity;
fLanguage :
English
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher :
ieee
ISSN :
0162-8828
Type :
jour
DOI :
10.1109/TPAMI.2012.21
Filename :
6127880
Link To Document :
بازگشت