DocumentCode
3745981
Title
Fisher Encoded Convolutional Bag-of-Windows for Efficient Image Retrieval and Social Image Tagging
Author
Tiberio Uricchio;Marco Bertini;Lorenzo Seidenari;Alberto Del Bimbo
Author_Institution
MICC, Univ. di Firenze, Florence, Italy
fYear
2015
Firstpage
1020
Lastpage
1026
Abstract
In this paper we present an efficient and accurate method to aggregate a set of Deep Convolutional Neural Network (CNN) responses, extracted from a set of image windows. CNN features are usually computed on the whole frame or with a dense multi scale approach. There is evidence that using multiple windows yields a better image representation nonetheless it is still not clear how windows should be sampled and how CNN responses should be aggregated. Instead of sampling the image densely in scale and space we show that selecting a few hundred windows is enough to obtain an effective image signature. We show how to use Fisher Vectors and PCA to obtain a short and highly descriptive signature that can be used effectively for image retrieval. We test our method on two relevant computer vision tasks: image retrieval and image tagging. We report state-of-the art results for both tasks on three standard datasets.
Keywords
"Image retrieval","Feature extraction","Tagging","Principal component analysis","Proposals","Image representation","Computer vision"
Publisher
ieee
Conference_Titel
Computer Vision Workshop (ICCVW), 2015 IEEE International Conference on
Type
conf
DOI
10.1109/ICCVW.2015.134
Filename
7406483
Link To Document