Title :
Visual Synset: Towards a higher-level visual representation
Author :
Zheng, Yan-Tao ; Zhao, Ming ; Neo, Shi-Yong ; Chua, Tat-Seng ; Tian, Qi
Author_Institution :
Nat. Univ. of Singapore, Singapore
Abstract :
We present a higher-level visual representation, visual synset, for object categorization. The visual synset improves the traditional bag of words representation with better discrimination and invariance power. First, the approach strengthens the inter-class discrimination power by constructing an intermediate visual descriptor, delta visual phrase, from frequently co-occurring visual word-set with similar spatial context. Second, the approach achieves better intra-class invariance power, by clustering delta visual phrases into visual synset, based their probabilistic dasiasemanticspsila, i.e. class probability distribution. Hence, the resulting visual synset can partially bridge the visual differences of images of same class. The tests on Caltech-101 and Pascal-VOC 05 dataset demonstrated that the proposed image representation can achieve good accuracies.
Keywords :
document image processing; image classification; image representation; object recognition; pattern clustering; probability; text analysis; bag of words representation; class probability distribution; delta visual phrase; delta visual phrase clustering; document image processing; higher-level visual representation; probabilistic semantics; text analysis; visual descriptor; visual object categorization; visual object recognition; visual synset; visual word-set; Bicycles; Bridges; Image representation; Motorcycles; Object recognition; Probability distribution; Robustness; Solid modeling; Testing; Vector quantization;
Conference_Titel :
Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on
Conference_Location :
Anchorage, AK
Print_ISBN :
978-1-4244-2242-5
Electronic_ISBN :
1063-6919
DOI :
10.1109/CVPR.2008.4587611