• DocumentCode
    2289573
  • Title

    Discriminative models for multi-class object layout

  • Author

    Desai, Chaitanya ; Ramanan, Deva ; Fowlkes, Charless

  • Author_Institution
    Dept. of Comput. Sci., U C Irvine, Irvine, CA, USA
  • fYear
    2009
  • fDate
    Sept. 29 2009-Oct. 2 2009
  • Firstpage
    229
  • Lastpage
    236
  • Abstract
    Many state-of-the-art approaches for object recognition reduce the problem to a 0-1 classification task. Such reductions allow one to leverage sophisticated classifiers for learning. These models are typically trained independently for each class using positive and negative examples cropped from images. At test-time, various post-processing heuristics such as non-maxima suppression (NMS) are required to reconcile multiple detections within and between different classes for each image. Though crucial to good performance on benchmarks, this post-processing is usually defined heuristically. We introduce a unified model for multi-class object recognition that casts the problem as a structured prediction task. Rather than predicting a binary label for each image window independently, our model simultaneously predicts a structured labeling of the entire image. Our model learns statistics that capture the spatial arrangements of various object classes in real images, both in terms of which arrangements to suppress through NMS and which arrangements to favor through spatial co-occurrence statistics. We formulate parameter estimation in our model as a max-margin learning problem. Given training images with ground-truth object locations, we show how to formulate learning as a convex optimization problem. We employ a cutting plane algorithm similar to efficiently learn a model from thousands of training images. We show state-of-the-art results on the PASCAL VOC benchmark that indicate the benefits of learning a global model encapsulating the spatial layout of multiple object classes.
  • Keywords
    convex programming; image classification; learning (artificial intelligence); object recognition; parameter estimation; statistics; PASCAL VOC benchmark; convex optimization; cutting plane algorithm; max-margin learning; multiclass object recognition; nonmaxima suppression; parameter estimation; spatial co-occurrence statistics; Benchmark testing; Computer science; Detectors; Face detection; Image edge detection; Labeling; Object detection; Object recognition; Predictive models; Statistics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision, 2009 IEEE 12th International Conference on
  • Conference_Location
    Kyoto
  • ISSN
    1550-5499
  • Print_ISBN
    978-1-4244-4420-5
  • Electronic_ISBN
    1550-5499
  • Type

    conf

  • DOI
    10.1109/ICCV.2009.5459256
  • Filename
    5459256