• DocumentCode
    3422062
  • Title

    Deformable Part Descriptors for Fine-Grained Recognition and Attribute Prediction

  • Author

    Ning Zhang ; Farrell, Ronan ; Iandola, Forrest ; Darrell, Trevor

  • Author_Institution
    ICSI / UC Berkeley, Berkeley, CA, USA
  • fYear
    2013
  • fDate
    1-8 Dec. 2013
  • Firstpage
    729
  • Lastpage
    736
  • Abstract
    Recognizing objects in fine-grained domains can be extremely challenging due to the subtle differences between subcategories. Discriminative markings are often highly localized, leading traditional object recognition approaches to struggle with the large pose variation often present in these domains. Pose-normalization seeks to align training exemplars, either piecewise by part or globally for the whole object, effectively factoring out differences in pose and in viewing angle. Prior approaches relied on computationally-expensive filter ensembles for part localization and required extensive supervision. This paper proposes two pose-normalized descriptors based on computationally-efficient deformable part models. The first leverages the semantics inherent in strongly-supervised DPM parts. The second exploits weak semantic annotations to learn cross-component correspondences, computing pose-normalized descriptors from the latent parts of a weakly-supervised DPM. These representations enable pooling across pose and viewpoint, in turn facilitating tasks such as fine-grained recognition and attribute prediction. Experiments conducted on the Caltech-UCSD Birds 200 dataset and Berkeley Human Attribute dataset demonstrate significant improvements of our approach over state-of-art algorithms.
  • Keywords
    learning (artificial intelligence); object recognition; pose estimation; Berkeley human attribute dataset; Caltech-UCSD Birds 200 dataset; DPM parts; attribute prediction; cross-component correspondence learning; deformable part descriptors; discriminative marking; fine-grained recognition; object recognition; part localization; pose variation; pose-normalization; pose-normalized descriptors; training exemplar alignment; weak semantic annotations; Birds; Equations; Feature extraction; Head; Mathematical model; Semantics; Training;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision (ICCV), 2013 IEEE International Conference on
  • Conference_Location
    Sydney, NSW
  • ISSN
    1550-5499
  • Type

    conf

  • DOI
    10.1109/ICCV.2013.96
  • Filename
    6751200