DocumentCode :
109940
Title :
Data-Driven Hierarchical Structure Kernel for Multiscale Part-Based Object Recognition
Author :
Botao Wang ; Hongkai Xiong ; Xiaoqian Jiang ; Zheng, Yuan F.
Author_Institution :
Dept. of Electron. Eng., Shanghai Jiao Tong Univ., Shanghai, China
Volume :
23
Issue :
4
fYear :
2014
fDate :
Apr-14
Firstpage :
1765
Lastpage :
1778
Abstract :
Detecting generic object categories in images and videos are a fundamental issue in computer vision. However, it faces the challenges from inter and intraclass diversity, as well as distortions caused by viewpoints, poses, deformations, and so on. To solve object variations, this paper constructs a structure kernel and proposes a multiscale part-based model incorporating the discriminative power of kernels. The structure kernel would measure the resemblance of part-based objects in three aspects: 1) the global similarity term to measure the resemblance of the global visual appearance of relevant objects; 2) the part similarity term to measure the resemblance of the visual appearance of distinctive parts; and 3) the spatial similarity term to measure the resemblance of the spatial layout of parts. In essence, the deformation of parts in the structure kernel is penalized in a multiscale space with respect to horizontal displacement, vertical displacement, and scale difference. Part similarities are combined with different weights, which are optimized efficiently to maximize the intraclass similarities and minimize the interclass similarities by the normalized stochastic gradient ascent algorithm. In addition, the parameters of the structure kernel are learned during the training process with regard to the distribution of the data in a more discriminative way. With flexible part sizes on scale and displacement, it can be more robust to the intraclass variations, poses, and viewpoints. Theoretical analysis and experimental evaluations demonstrate that the proposed multiscale part-based representation model with structure kernel exhibits accurate and robust performance, and outperforms state-of-the-art object classification approaches.
Keywords :
computer vision; displacement measurement; gradient methods; image representation; mechanical variables measurement; object detection; object recognition; stochastic processes; computer vision; data distribution; data-driven hierarchical structure kernel; discriminative kernel power; distinctive part visual appearance measurement; generic object category detection; global similarity term; horizontal displacement; interclass diversity; interclass similarity minimization; intraclass diversity; intraclass similarity maximization; multiscale part-based object recognition model; multiscale part-based representation model; normalized stochastic gradient ascent algorithm; object variations; part deformation penalization; part similarity term; part spatial layout resemblance measurement; part-based object resemblance measurement; relevant object global visual appearance resemblance measurement; scale difference; spatial similarity term; training process; vertical displacement; Feature extraction; Histograms; Kernel; Layout; Object recognition; Vectors; Visualization; Object recognition; feature extraction; multiscale part-based model; structure kernel; support vector machine;
fLanguage :
English
Journal_Title :
Image Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1057-7149
Type :
jour
DOI :
10.1109/TIP.2014.2307480
Filename :
6746174
Link To Document :
بازگشت