• DocumentCode
    3745981
  • Title

    Fisher Encoded Convolutional Bag-of-Windows for Efficient Image Retrieval and Social Image Tagging

  • Author

    Tiberio Uricchio;Marco Bertini;Lorenzo Seidenari;Alberto Del Bimbo

  • Author_Institution
    MICC, Univ. di Firenze, Florence, Italy
  • fYear
    2015
  • Firstpage
    1020
  • Lastpage
    1026
  • Abstract
    In this paper we present an efficient and accurate method to aggregate a set of Deep Convolutional Neural Network (CNN) responses, extracted from a set of image windows. CNN features are usually computed on the whole frame or with a dense multi scale approach. There is evidence that using multiple windows yields a better image representation nonetheless it is still not clear how windows should be sampled and how CNN responses should be aggregated. Instead of sampling the image densely in scale and space we show that selecting a few hundred windows is enough to obtain an effective image signature. We show how to use Fisher Vectors and PCA to obtain a short and highly descriptive signature that can be used effectively for image retrieval. We test our method on two relevant computer vision tasks: image retrieval and image tagging. We report state-of-the art results for both tasks on three standard datasets.
  • Keywords
    "Image retrieval","Feature extraction","Tagging","Principal component analysis","Proposals","Image representation","Computer vision"
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision Workshop (ICCVW), 2015 IEEE International Conference on
  • Type

    conf

  • DOI
    10.1109/ICCVW.2015.134
  • Filename
    7406483