• DocumentCode
    109940
  • Title

    Data-Driven Hierarchical Structure Kernel for Multiscale Part-Based Object Recognition

  • Author

    Botao Wang ; Hongkai Xiong ; Xiaoqian Jiang ; Zheng, Yuan F.

  • Author_Institution
    Dept. of Electron. Eng., Shanghai Jiao Tong Univ., Shanghai, China
  • Volume
    23
  • Issue
    4
  • fYear
    2014
  • fDate
    Apr-14
  • Firstpage
    1765
  • Lastpage
    1778
  • Abstract
    Detecting generic object categories in images and videos are a fundamental issue in computer vision. However, it faces the challenges from inter and intraclass diversity, as well as distortions caused by viewpoints, poses, deformations, and so on. To solve object variations, this paper constructs a structure kernel and proposes a multiscale part-based model incorporating the discriminative power of kernels. The structure kernel would measure the resemblance of part-based objects in three aspects: 1) the global similarity term to measure the resemblance of the global visual appearance of relevant objects; 2) the part similarity term to measure the resemblance of the visual appearance of distinctive parts; and 3) the spatial similarity term to measure the resemblance of the spatial layout of parts. In essence, the deformation of parts in the structure kernel is penalized in a multiscale space with respect to horizontal displacement, vertical displacement, and scale difference. Part similarities are combined with different weights, which are optimized efficiently to maximize the intraclass similarities and minimize the interclass similarities by the normalized stochastic gradient ascent algorithm. In addition, the parameters of the structure kernel are learned during the training process with regard to the distribution of the data in a more discriminative way. With flexible part sizes on scale and displacement, it can be more robust to the intraclass variations, poses, and viewpoints. Theoretical analysis and experimental evaluations demonstrate that the proposed multiscale part-based representation model with structure kernel exhibits accurate and robust performance, and outperforms state-of-the-art object classification approaches.
  • Keywords
    computer vision; displacement measurement; gradient methods; image representation; mechanical variables measurement; object detection; object recognition; stochastic processes; computer vision; data distribution; data-driven hierarchical structure kernel; discriminative kernel power; distinctive part visual appearance measurement; generic object category detection; global similarity term; horizontal displacement; interclass diversity; interclass similarity minimization; intraclass diversity; intraclass similarity maximization; multiscale part-based object recognition model; multiscale part-based representation model; normalized stochastic gradient ascent algorithm; object variations; part deformation penalization; part similarity term; part spatial layout resemblance measurement; part-based object resemblance measurement; relevant object global visual appearance resemblance measurement; scale difference; spatial similarity term; training process; vertical displacement; Feature extraction; Histograms; Kernel; Layout; Object recognition; Vectors; Visualization; Object recognition; feature extraction; multiscale part-based model; structure kernel; support vector machine;
  • fLanguage
    English
  • Journal_Title
    Image Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1057-7149
  • Type

    jour

  • DOI
    10.1109/TIP.2014.2307480
  • Filename
    6746174