• DocumentCode
    3748586
  • Title

    Scalable Nonlinear Embeddings for Semantic Category-Based Image Retrieval

  • Author

    Gaurav Sharma;Bernt Schiele

  • Author_Institution
    Max Planck Inst. for Inf., Saarbrυ
  • fYear
    2015
  • Firstpage
    1296
  • Lastpage
    1304
  • Abstract
    We propose a novel algorithm for the task of supervised discriminative distance learning by nonlinearly embedding vectors into a low dimensional Euclidean space. We work in the challenging setting where supervision is with constraints on similar and dissimilar pairs while training. The proposed method is derived by an approximate kernelization of a linear Mahalanobis-like distance metric learning algorithm and can also be seen as a kernel neural network. The number of model parameters and test time evaluation complexity of the proposed method are O(dD) where D is the dimensionality of the input features and d is the dimension of the projection space -- this is in contrast to the usual kernelization methods as, unlike them, the complexity does not scale linearly with the number of training examples. We propose a stochastic gradient based learning algorithm which makes the method scalable (w.r.t. the number of training examples), while being nonlinear. We train the method with up to half a million training pairs of 4096 dimensional CNN features. We give empirical comparisons with relevant baselines on seven challenging datasets for the task of low dimensional semantic category based image retrieval.
  • Keywords
    "Measurement","Kernel","Training","Computer vision","Complexity theory","Approximation algorithms","Neural networks"
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision (ICCV), 2015 IEEE International Conference on
  • Electronic_ISBN
    2380-7504
  • Type

    conf

  • DOI
    10.1109/ICCV.2015.153
  • Filename
    7410510