DocumentCode
1236830
Title
80 Million Tiny Images: A Large Data Set for Nonparametric Object and Scene Recognition
Author
Torralba, Antonio ; Fergus, Rob ; Freeman, William T.
Author_Institution
Comput. Sci. & Artificial Intell. Lab., Massachusetts Inst. of Technol., Cambridge, MA
Volume
30
Issue
11
fYear
2008
Firstpage
1958
Lastpage
1970
Abstract
With the advent of the Internet, billions of images are now freely available online and constitute a dense sampling of the visual world. Using a variety of non-parametric methods, we explore this world with the aid of a large dataset of 79,302,017 images collected from the Internet. Motivated by psychophysical results showing the remarkable tolerance of the human visual system to degradations in image resolution, the images in the dataset are stored as 32 x 32 color images. Each image is loosely labeled with one of the 75,062 non-abstract nouns in English, as listed in the Wordnet lexical database. Hence the image database gives a comprehensive coverage of all object categories and scenes. The semantic information from Wordnet can be used in conjunction with nearest-neighbor methods to perform object classification over a range of semantic levels minimizing the effects of labeling noise. For certain classes that are particularly prevalent in the dataset, such as people, we are able to demonstrate a recognition performance comparable to class-specific Viola-Jones style detectors.
Keywords
Internet; image colour analysis; image recognition; image resolution; object detection; object recognition; Internet; Wordnet; class-specific Viola-Jones style detectors; color images; human visual system; image resolution; nearest-neighbor methods; nonparametric object recognition; nonparametric scene recognition; object classification; Computer vision; Object recognition; large datasets; nearest-neighbor methods; Artificial Intelligence; Database Management Systems; Databases, Factual; Documentation; Image Enhancement; Image Interpretation, Computer-Assisted; Information Storage and Retrieval; Internet; Pattern Recognition, Automated;
fLanguage
English
Journal_Title
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher
ieee
ISSN
0162-8828
Type
jour
DOI
10.1109/TPAMI.2008.128
Filename
4531741
Link To Document