DocumentCode :
2492515
Title :
Object Recognition via Adaptive Multi-level Feature Integration
Author :
Wang, Mei ; Wu, Yanling ; Li, Guangda ; Zhou, Xiangdong
Author_Institution :
Sch. of Comput. Sci. & Technol., Donghua Univ., Shanghai, China
fYear :
2010
fDate :
6-8 April 2010
Firstpage :
253
Lastpage :
259
Abstract :
Object category recognition is a challenging task due to the low level and non-discrimination in visual representation. Most previous methods concentrate to find better high level visual features. Recently, optimally integrating various features to solve the problem attracted more interests. In this paper, we provide a novel method for object category recognition by improving the popular bag-of-words (BoW) methods from the following two aspects. First, we propose to extract a series of high level visual features which exploit both the local spatial co occurrence between low level visual words and the global spatial layout of the object parts. To obtain the global spatial features, a fast method is proposed to generate the semantic meaningful object parts by exploiting the geometric position distribution of the local salient regions. The image part patches are further quantized as semantic coherent high level visual words by using correlational spectral clustering. Based on it, simplified 2D string representation is introduced to model the global spatial patterns of the objects. Second, a multi-kernel learning framework is proposed to adaptively integrate extracted features in an optimal way. For each object class, an optimal feature weight coefficient is learned automatically and separately to combine both the low level and high level visual features by considering their contribution for the different object class. The tests on Caltech-101 and Pascal- VOC 06 dataset demonstrated that our method outperforms the baseline method BoW and state-of-the-art Multi-CM model .
Keywords :
feature extraction; image representation; learning (artificial intelligence); object recognition; pattern classification; pattern clustering; quantisation (signal); Caltech-101 dataset; Pascal-VOC 06 dataset; adaptive multilevel feature integration; bag-of-word methods; baseline BoW method; correlational spectral clustering; geometric position distribution; high level visual feature extraction; local salient regions; multikernel learning framework; object category recognition; optimal feature weight coefficient; semantic coherent high level visual word quantization; simplified 2D string representation; visual representation; Computer science; Data mining; Digital images; Feature extraction; Image databases; Information technology; Object recognition; Testing; Visual databases; Windows;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Web Conference (APWEB), 2010 12th International Asia-Pacific
Conference_Location :
Busan
Print_ISBN :
978-1-7695-4012-2
Electronic_ISBN :
978-1-4244-6600-9
Type :
conf
DOI :
10.1109/APWeb.2010.24
Filename :
5474128
Link To Document :
بازگشت