• DocumentCode
    57866
  • Title

    Extracting 3D Layout From a Single Image Using Global Image Structures

  • Author

    Zhongyu Lou ; Gevers, Theo ; Ninghang Hu

  • Author_Institution
    Intell. Syst. Lab. Amsterdam, Univ. of Amsterdam, Amsterdam, Netherlands
  • Volume
    24
  • Issue
    10
  • fYear
    2015
  • fDate
    Oct. 2015
  • Firstpage
    3098
  • Lastpage
    3108
  • Abstract
    Extracting the pixel-level 3D layout from a single image is important for different applications, such as object localization, image, and video categorization. Traditionally, the 3D layout is derived by solving a pixel-level classification problem. However, the image-level 3D structure can be very beneficial for extracting pixel-level 3D layout since it implies the way how pixels in the image are organized. In this paper, we propose an approach that first predicts the global image structure, and then we use the global structure for fine-grained pixel-level 3D layout extraction. In particular, image features are extracted based on multiple layout templates. We then learn a discriminative model for classifying the global layout at the image-level. Using latent variables, we implicitly model the sublevel semantics of the image, which enrich the expressiveness of our model. After the image-level structure is obtained, it is used as the prior knowledge to infer pixel-wise 3D layout. Experiments show that the results of our model outperform the state-of-the-art methods by 11.7% for 3D structure classification. Moreover, we show that employing the 3D structure prior information yields accurate 3D scene layout segmentation.
  • Keywords
    feature extraction; image classification; image segmentation; 3D scene layout segmentation; discriminative model; fine-grained pixel-level 3D layout extraction; global image structure; image feature extraction; image sublevel semantics; image-level 3D structure classification; pixel-level classification problem; Feature extraction; Geometry; Graphical models; Image segmentation; Layout; Three-dimensional displays; Training data; 3D layout; Stage classification; Structural SVM; structural SVM;
  • fLanguage
    English
  • Journal_Title
    Image Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1057-7149
  • Type

    jour

  • DOI
    10.1109/TIP.2015.2431443
  • Filename
    7104156