• DocumentCode
    61674
  • Title

    Learning Hierarchical Features for Scene Labeling

  • Author

    Farabet, Clement ; Couprie, C. ; Najman, Laurent ; LeCun, Yann

  • Author_Institution
    Courant Inst. of Math. Sci., New York Univ., New York, NY, USA
  • Volume
    35
  • Issue
    8
  • fYear
    2013
  • fDate
    Aug. 2013
  • Firstpage
    1915
  • Lastpage
    1929
  • Abstract
    Scene labeling consists of labeling each pixel in an image with the category of the object it belongs to. We propose a method that uses a multiscale convolutional network trained from raw pixels to extract dense feature vectors that encode regions of multiple sizes centered on each pixel. The method alleviates the need for engineered features, and produces a powerful representation that captures texture, shape, and contextual information. We report results using multiple postprocessing methods to produce the final labeling. Among those, we propose a technique to automatically retrieve, from a pool of segmentation components, an optimal set of components that best explain the scene; these components are arbitrary, for example, they can be taken from a segmentation tree or from any family of oversegmentations. The system yields record accuracies on the SIFT Flow dataset (33 classes) and the Barcelona dataset (170 classes) and near-record accuracy on Stanford background dataset (eight classes), while being an order of magnitude faster than competing approaches, producing a 320×240 image labeling in less than a second, including feature extraction.
  • Keywords
    feature extraction; image classification; image segmentation; image texture; shape recognition; transforms; trees (mathematics); Barcelona dataset; SIFT flow dataset; Stanford background dataset; contextual information capturing; dense feature vector extraction; hierarchical feature learning; image labeling; image pixel labeling; multiple size region encoding; multiscale convolutional network; near-record accuracy; object category; scene labeling; segmentation components; segmentation tree; shape information capturing; texture information capturing; Accuracy; Context; Feature extraction; Image edge detection; Image segmentation; Labeling; Vectors; Convolutional networks; deep learning; image classification; image segmentation; scene parsing;
  • fLanguage
    English
  • Journal_Title
    Pattern Analysis and Machine Intelligence, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0162-8828
  • Type

    jour

  • DOI
    10.1109/TPAMI.2012.231
  • Filename
    6338939