DocumentCode
3672131
Title
Is object localization for free? - Weakly-supervised learning with convolutional neural networks
Author
Maxime Oquab;Léon Bottou;Ivan Laptev;Josef Sivic
Author_Institution
INRIA Paris, France
fYear
2015
fDate
6/1/2015 12:00:00 AM
Firstpage
685
Lastpage
694
Abstract
Successful methods for visual object recognition typically rely on training datasets containing lots of richly annotated images. Detailed image annotation, e.g. by object bounding boxes, however, is both expensive and often subjective. We describe a weakly supervised convolutional neural network (CNN) for object classification that relies only on image-level labels, yet can learn from cluttered scenes containing multiple objects. We quantify its object classification and object location prediction performance on the Pascal VOC 2012 (20 object classes) and the much larger Microsoft COCO (80 object classes) datasets. We find that the network (i) outputs accurate image-level labels, (ii) predicts approximate locations (but not extents) of objects, and (iii) performs comparably to its fully-supervised counterparts using object bounding box annotation for training.
Keywords
"Training","Search problems","Visualization","Object recognition","Supervised learning","Neural networks","Computer architecture"
Publisher
ieee
Conference_Titel
Computer Vision and Pattern Recognition (CVPR), 2015 IEEE Conference on
Electronic_ISBN
1063-6919
Type
conf
DOI
10.1109/CVPR.2015.7298668
Filename
7298668
Link To Document