Title :
The role of features, algorithms and data in visual recognition
Author :
Parikh, Devi ; Zitnick, C. Lawrence
Author_Institution :
Toyota Technol. Inst., Chicago (TTIC), Chicago, IL, USA
Abstract :
There are many computer vision algorithms developed for visual (scene and object) recognition. Some systems focus on involved learning algorithms, some leverage millions of training images, and some systems focus on modeling relevant information (features) with the goal of effective recognition. However, none of these systems come close to human capabilities. If we study human responses on similar problems we could gain insight into which of the three factors (1) learning algorithm (2) amount of training data and (3) features is critical to humans´ superior performance. In this work we take a small step towards this goal by performing a series of human studies and machine experiments. We find no evidence that human pattern matching algorithms are better than standard machine learning algorithms. Moreover, we find that humans don´t leverage increased amounts of training data. Through statistical analysis on the machine experiments and supporting human studies, we find that the main factor impacting accuracies is the choice of features.
Keywords :
computer vision; feature extraction; image recognition; object recognition; statistical analysis; human pattern matching algorithms; machine learning algorithms; object recognition; training images; visual recognition; Analysis of variance; Computer vision; Feature extraction; Humans; Image recognition; Layout; Machine learning; Machine learning algorithms; Pattern matching; Training data;
Conference_Titel :
Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on
Conference_Location :
San Francisco, CA
Print_ISBN :
978-1-4244-6984-0
DOI :
10.1109/CVPR.2010.5539920