Title :
Learning Hierarchical Space Tiling for Scene Modeling, Parsing and Attribute Tagging
Author :
Shuo Wang ; Yizhou Wang ; Song-Chun Zhu
Author_Institution :
Dept. of EECS, Peking Univ., Beijing, China
Abstract :
A typical scene category contains an enormous number of distinct scene configurations that are composed of objects and regions of varying shapes in different layouts. In this paper, we first propose a representation named hierarchical space tiling (HST) to quantize the huge and continuous scene configuration space. Then, we augment the HST with attributes (nouns and adjectives) to describe the semantics of the objects and regions inside a scene. We present a weakly supervised method for simultaneously learning the scene configurations and attributes from a collection of natural images associated with descriptive text. The precise locations of attributes are unknown in the input and are mapped to the HST nodes through learning. Starting with a full HST, we iteratively estimate the HST model under a learning-by-parsing framework. Given a test image, we compute the most probable parse tree with the associated attributes by dynamic programming. We quantitatively analyze the representative efficiency of HST, show the learned representation is less ambiguous and has semantically meaningful inner concepts. In applications, we apply our model to four tasks: scene classification, attribute recognition, attribute localization, and pixel-wise scene labeling, and show the performance improvements as well as higher efficiency.
Keywords :
attribute grammars; dynamic programming; learning (artificial intelligence); program compilers; trees (mathematics); HST nodes; adjectives; attribute localization; attribute recognition; attribute tagging; continuous scene configuration space; distinct scene configurations; dynamic programming; hierarchical space tiling learning; learning-by-parsing framework; natural images; nouns; parse tree; pixel-wise scene labeling; scene modeling; weakly supervised method; Cloud computing; Computational modeling; Image segmentation; Scene representation; Semantics; Shape analysis; Hierarchical Space Tiling; Scene Attributes; Scene Representation; Scene representation; hierarchical space tiling; scene attributes;
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
DOI :
10.1109/TPAMI.2015.2424880