Title :
Tiny Videos: A Large Data Set for Nonparametric Video Retrieval and Frame Classification
Author :
Karpenko, Alexandre ; Aarabi, Parham
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of Toronto, Toronto, ON, Canada
fDate :
3/1/2011 12:00:00 AM
Abstract :
In this paper, we present a large database of over 50,000 user-labeled videos collected from YouTube. We develop a compact representation called “tiny videos” that achieves high video compression rates while retaining the overall visual appearance of the video as it varies over time. We show that frame sampling using affinity propagation - an exemplar-based clustering algorithm - achieves the best trade-off between compression and video recall. We use this large collection of user-labeled videos in conjunction with simple data mining techniques to perform related video retrieval, as well as classification of images and video frames. The classification results achieved by tiny videos are compared with the tiny images framework for a variety of recognition tasks. The tiny images data set consists of 80 million images collected from the Internet. These are the largest labeled research data sets of videos and images available to date. We show that tiny videos are better suited for classifying scenery and sports activities, while tiny images perform better at recognizing objects. Furthermore, we demonstrate that combining the tiny images and tiny videos data sets improves classification precision in a wider range of categories.
Keywords :
data compression; data mining; image classification; image representation; pattern clustering; sampling methods; video coding; video retrieval; Internet; YouTube; affinity propagation; data mining techniques; exemplar based clustering algorithm; frame sampling; high video compression rates; nonparametric video retrieval; tiny videos; user labeled videos; video frames classification; video recall; visual appearance; Clustering algorithms; Data mining; Image coding; Image recognition; Image retrieval; Image sampling; Information retrieval; Video compression; Visual databases; YouTube; Image classification; content-based retrieval; data mining; nearest-neighbor methods.; tiny images; tiny videos; Algorithms; Artificial Intelligence; Cluster Analysis; Database Management Systems; Databases, Factual; Fractals; Image Enhancement; Image Interpretation, Computer-Assisted; Information Storage and Retrieval; Internet; Pattern Recognition, Automated; Video Recording; Videotape Recording;
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
DOI :
10.1109/TPAMI.2010.118