DocumentCode
2291516
Title
Top-down color attention for object recognition
Author
Khan, Fahad Shahbaz ; Van de Weijer, Joost ; Vanrell, Maria
Author_Institution
Comput. Sci. Dept., Univ. Autonoma de Barcelona, Barcelona, Spain
fYear
2009
fDate
Sept. 29 2009-Oct. 2 2009
Firstpage
979
Lastpage
986
Abstract
Generally the bag-of-words based image representation follows a bottom-up paradigm. The subsequent stages of the process: feature detection, feature description, vocabulary construction and image representation are performed independent of the intentioned object classes to be detected. In such a framework, combining multiple cues such as shape and color often provides below-expected results. This paper presents a novel method for recognizing object categories when using multiple cues by separating the shape and color cue. Color is used to guide attention by means of a top-down category-specific attention map. The color attention map is then further deployed to modulate the shape features by taking more features from regions within an image that are likely to contain an object instance. This procedure leads to a category-specific image histogram representation for each category. Furthermore, we argue that the method combines the advantages of both early and late fusion. We compare our approach with existing methods that combine color and shape cues on three data sets containing varied importance of both cues, namely, Soccer ( color predominance), Flower (color and shape parity), and PASCAL VOC Challenge 2007 (shape predominance). The experiments clearly demonstrate that in all three data sets our proposed framework significantly outperforms the state-of-the-art methods for combining color and shape information.
Keywords
feature extraction; image colour analysis; image representation; object recognition; bag-of-words; bottom-up paradigm; category-specific image histogram; color cue; feature description; feature detection; image representation; multiple cues; object recognition; shape cue; shape features; top-down color attention; vocabulary construction; Computer science; Computer vision; Histograms; Image color analysis; Image representation; Information analysis; Object detection; Object recognition; Shape; Vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Vision, 2009 IEEE 12th International Conference on
Conference_Location
Kyoto
ISSN
1550-5499
Print_ISBN
978-1-4244-4420-5
Electronic_ISBN
1550-5499
Type
conf
DOI
10.1109/ICCV.2009.5459362
Filename
5459362
Link To Document