• DocumentCode
    253538
  • Title

    Human vs. Computer in Scene and Object Recognition

  • Author

    Borji, Ali ; Itti, Laurent

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Southern California, Los Angeles, CA, USA
  • fYear
    2014
  • fDate
    23-28 June 2014
  • Firstpage
    113
  • Lastpage
    120
  • Abstract
    Several decades of research in computer and primate vision have resulted in many models (some specialized for one problem, others more general) and invaluable experimental data. Here, to help focus research efforts onto the hardest unsolved problems, and bridge computer and human vision, we define a battery of 5 tests that measure the gap between human and machine performances in several dimensions (generalization across scene categories, generalization from images to edge maps and line drawings, invariance to rotation and scaling, local/global information with jumbled images, and object recognition performance). We measure model accuracy and the correlation between model and human error patterns. Experimenting over 7 datasets, where human data is available, and gauging 14 well-established models, we find that none fully resembles humans in all aspects, and we learn from each test which models and features are more promising in approaching humans in the tested dimension. Across all tests, we find that models based on local edge histograms consistently resemble humans more, while several scene statistics or "gist" models do perform well with both scenes and objects. While computer vision has long been inspired by human vision, we believe systematic efforts, such as this, will help better identify shortcomings of models and find new paths forward.
  • Keywords
    computer vision; image recognition; object recognition; bridge computer; computer vision; human error pattern; human vision; local edge histogram; local-global information; machine performance; object recognition performance; primate vision; Accuracy; Animals; Computational modeling; Histograms; Image color analysis; Object recognition; Sun; computer vision; human vision; jumbled images; line drawings; object recognition; scene recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on
  • Conference_Location
    Columbus, OH
  • Type

    conf

  • DOI
    10.1109/CVPR.2014.22
  • Filename
    6909416