Title :
Fisher Encoded Convolutional Bag-of-Windows for Efficient Image Retrieval and Social Image Tagging
Author :
Tiberio Uricchio;Marco Bertini;Lorenzo Seidenari;Alberto Del Bimbo
Author_Institution :
MICC, Univ. di Firenze, Florence, Italy
Abstract :
In this paper we present an efficient and accurate method to aggregate a set of Deep Convolutional Neural Network (CNN) responses, extracted from a set of image windows. CNN features are usually computed on the whole frame or with a dense multi scale approach. There is evidence that using multiple windows yields a better image representation nonetheless it is still not clear how windows should be sampled and how CNN responses should be aggregated. Instead of sampling the image densely in scale and space we show that selecting a few hundred windows is enough to obtain an effective image signature. We show how to use Fisher Vectors and PCA to obtain a short and highly descriptive signature that can be used effectively for image retrieval. We test our method on two relevant computer vision tasks: image retrieval and image tagging. We report state-of-the art results for both tasks on three standard datasets.
Keywords :
"Image retrieval","Feature extraction","Tagging","Principal component analysis","Proposals","Image representation","Computer vision"
Conference_Titel :
Computer Vision Workshop (ICCVW), 2015 IEEE International Conference on
DOI :
10.1109/ICCVW.2015.134