• DocumentCode
    3526770
  • Title

    Large scale natural image classification by sparsity exploration

  • Author

    Wang, Changhu ; Yan, Shuicheng ; Zhang, Hong-Jiang

  • Author_Institution
    MOE-MS Key Lab. of MCC, Univ. of Sci. & Technol. of China, Hefei
  • fYear
    2009
  • fDate
    19-24 April 2009
  • Firstpage
    3709
  • Lastpage
    3712
  • Abstract
    We consider in this paper the problem of large scale natural image classification. As the explosion and popularity of images in the Internet, there are increasing attentions to utilize millions of or even billions of these images for helping image related research. Beyond the opportunities brought by unlimited data, a great challenge is how to design more effective classification methods under these large scale scenarios. Most of existing attempts are based on k-nearest-neighbor method. However, in spite of the optimistic performance in some tasks, this strategy still suffers from that, one single fixed global parameter k is not robust for different object classes from different semantic levels. In this paper, we propose an alternative method, called lscr1-nearest-neighbor, based on a sparse representation computed by lscr1-minimization. We first treat a testing sample as a sparse linear combination of all training samples, and then consider the related samples as the nearest neighbors of the testing sample. Finally, we classify the testing sample based on the majority of these neighbors´ classes. We conduct extensive experiments on a 1.6 million natural image database on different semantic levels defined based on WordNet, which demonstrate that the proposed lscr1-nearest-neighbor algorithm outperforms k-nearest-neighbor in two aspects: 1) the robustness of parameter selection for different semantic levels, and 2) the discriminative capability for large scale image classification task.
  • Keywords
    image classification; visual databases; Internet; k-nearest-neighbor method; large scale natural image classification; natural image database; Computer vision; Image classification; Image recognition; Information retrieval; Large-scale systems; Layout; Robustness; Signal processing; Signal processing algorithms; Testing; ℓ1-nearest-neighbor; Image classification; WordNet; k-nearest-neighbor; sparsity;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
  • Conference_Location
    Taipei
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-2353-8
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2009.4960432
  • Filename
    4960432