• DocumentCode
    3672370
  • Title

    Evaluation of output embeddings for fine-grained image classification

  • Author

    Zeynep Akata;Scott Reed;Daniel Walter; Honglak Lee;Bernt Schiele

  • Author_Institution
    Computer Vision and Multimodal Computing, Max Planck Institute for Informatics, Saarbrucken, Germany
  • fYear
    2015
  • fDate
    6/1/2015 12:00:00 AM
  • Firstpage
    2927
  • Lastpage
    2936
  • Abstract
    Image classification has advanced significantly in recent years with the availability of large-scale image sets. However, fine-grained classification remains a major challenge due to the annotation cost of large numbers of fine-grained categories. This project shows that compelling classification performance can be achieved on such categories even without labeled training data. Given image and class embeddings, we learn a compatibility function such that matching embeddings are assigned a higher score than mismatching ones; zero-shot classification of an image proceeds by finding the label yielding the highest joint compatibility score. We use state-of-the-art image features and focus on different supervised attributes and unsupervised output embeddings either derived from hierarchies or learned from unlabeled text corpora. We establish a substantially improved state-of-the-art on the Animals with Attributes and Caltech-UCSD Birds datasets. Most encouragingly, we demonstrate that purely unsupervised output embeddings (learned from Wikipedia and improved with finegrained text) achieve compelling results, even outperforming the previous supervised state-of-the-art. By combining different output embeddings, we further improve results.
  • Keywords
    "Context","Joints","Encyclopedias","Electronic publishing","Internet","Vocabulary"
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision and Pattern Recognition (CVPR), 2015 IEEE Conference on
  • Electronic_ISBN
    1063-6919
  • Type

    conf

  • DOI
    10.1109/CVPR.2015.7298911
  • Filename
    7298911