• DocumentCode
    2954697
  • Title

    Recognizing jumbled images: The role of local and global information in image classification

  • Author

    Parikh, Devi

  • Author_Institution
    Toyota Technol. Inst., Chicago, IL, USA
  • fYear
    2011
  • fDate
    6-13 Nov. 2011
  • Firstpage
    519
  • Lastpage
    526
  • Abstract
    The performance of current state-of-the-art computer vision algorithms at image classification falls significantly short as compared to human abilities. To reduce this gap, it is important for the community to know what problems to solve, and not just how to solve them. Towards this goal, via the use of jumbled images, we strip apart two widely investigated aspects: local and global information in images, and identify the performance bottleneck. Interestingly, humans have been shown to reliably recognize jumbled images. The goal of our paper is to determine a functional model that mimics how humans recognize jumbled images i.e. exploit local information alone, and further evaluate if existing implementations of this computational model suffice to match human performance. Surprisingly, in our series of human studies and machine experiments, we find that a simple bag-of-words based majority-vote-like strategy is an accurate functional model of how humans recognize jumbled images. Moreover, a straightforward machine implementation of this model achieves accuracies similar to human subjects at classifying jumbled images. This indicates that perhaps existing machine vision techniques already leverage local information from images effectively, and future research efforts should be focused on more advanced modeling of global information.
  • Keywords
    computer vision; image classification; bag-of-words based majority-vote-like strategy; computational model; computer vision algorithms; functional model; global information; human abilities; image classification; jumbled image recognition; local information; machine vision techniques; Accuracy; Computational modeling; Face; Humans; Image recognition; Object recognition; Reliability;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision (ICCV), 2011 IEEE International Conference on
  • Conference_Location
    Barcelona
  • ISSN
    1550-5499
  • Print_ISBN
    978-1-4577-1101-5
  • Type

    conf

  • DOI
    10.1109/ICCV.2011.6126283
  • Filename
    6126283