DocumentCode :
3005663
Title :
Towards total scene understanding: Classification, annotation and segmentation in an automatic framework
Author :
Li-Jia Li ; Socher, Richard ; Li Fei-Fei
Author_Institution :
Dept. of Comput. Sci., Princeton Univ., Princeton, NJ, USA
fYear :
2009
fDate :
20-25 June 2009
Firstpage :
2036
Lastpage :
2043
Abstract :
Given an image, we propose a hierarchical generative model that classifies the overall scene, recognizes and segments each object component, as well as annotates the image with a list of tags. To our knowledge, this is the first model that performs all three tasks in one coherent framework. For instance, a scene of a `polo game´ consists of several visual objects such as `human´, `horse´, `grass´, etc. In addition, it can be further annotated with a list of more abstract (e.g. `dusk´) or visually less salient (e.g. `saddle´) tags. Our generative model jointly explains images through a visual model and a textual model. Visually relevant objects are represented by regions and patches, while visually irrelevant textual annotations are influenced directly by the overall scene class. We propose a fully automatic learning framework that is able to learn robust scene models from noisy Web data such as images and user tags from Flickr.com. We demonstrate the effectiveness of our framework by automatically classifying, annotating and segmenting images from eight classes depicting sport scenes. In all three tasks, our model significantly outperforms state-of-the-art algorithms.
Keywords :
image classification; image segmentation; object recognition; automatic framework; image annotation; image classification; image segmentation; object component recognition; polo game; sport scene; textual model; total scene understanding; visual model; visual object; Layout;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on
Conference_Location :
Miami, FL
ISSN :
1063-6919
Print_ISBN :
978-1-4244-3992-8
Type :
conf
DOI :
10.1109/CVPR.2009.5206718
Filename :
5206718
Link To Document :
بازگشت