DocumentCode :
33859
Title :
Data-Driven Scene Understanding with Adaptively Retrieved Exemplars
Author :
Xionghao Liu ; Wei Yang ; Liang Lin ; Qing Wang ; Zhaoquan Cai ; Jianhuang Lai
Author_Institution :
Sun Yat-sen Univ., Guangzhou, China
Volume :
22
Issue :
3
fYear :
2015
fDate :
July-Sept. 2015
Firstpage :
82
Lastpage :
92
Abstract :
This article investigates a data-driven approach for semantic scene understanding, without pixelwise annotation or classifier training. The proposed framework parses a target image in two steps: first, retrieving its exemplars (that is, references) from an image database, where all images are unsegmented but annotated with tags; second, recovering its pixel labels by propagating semantics from the references. The authors present a novel framework making the two steps mutually conditional and bootstrapped under the probabilistic Expectation-Maximization (EM) formulation. In the first step, the system selects the references by jointly matching the appearances as well as the semantics (that is, the assigned labels) with the target. They process the second step via a combinatorial graphical representation, in which the vertices are superpixels extracted from the target and its selected references. Then they derive the potentials of assigning labels to one vertex of the target, which depend upon the graph edges that connect the vertex to its spatial neighbors of the target and to similar vertices of the references. The proposed framework can be applied naturally to perform image annotation on new test images. In the experiments, the authors validated their approach on two public databases, and demonstrated superior performance over the state-of-the-art methods in both semantic segmentation and image annotation tasks.
Keywords :
expectation-maximisation algorithm; graph theory; image matching; image retrieval; probability; statistical analysis; visual databases; EM formulation; adaptively retrieved exemplars; appearance matching; assigned labels; bootstrapping; combinatorial graphical representation; data-driven approach; graph edges; image annotation task; image database; mutually conditional framework; pixel labels; probabilistic expectation-maximization formulation; public databases; semantic matching; semantic scene understanding; semantic segmentation task; spatial neighbors; super-pixel extracted vertices; target image parsing; unsegmented-annotated images; Computer graphics; Data analysis; Graphical models; Image edge detection; Image reconstruction; Image segmentation; Optimization; Semantics; data analysis; graphical model; graphics; image annotation; image retrieval; multimedia; scene understanding; semantic segmentation;
fLanguage :
English
Journal_Title :
MultiMedia, IEEE
Publisher :
ieee
ISSN :
1070-986X
Type :
jour
DOI :
10.1109/MMUL.2015.22
Filename :
7018682
Link To Document :
بازگشت