مرکز منطقه ای اطلاع رساني علوم و فناوري - Unsupervised learning of high-order structural semantics from images

DocumentCode :

2293481

Title :

Unsupervised learning of high-order structural semantics from images

Author :

Gao, Jizhou ; Hu, Yin ; Liu, Jinze ; Yang, Ruigang

Author_Institution :

Center for Visualization & Virtual Environments, Univ. of Kentucky, Lexington, KY, USA

fYear :

2009

fDate :

Sept. 29 2009-Oct. 2 2009

Firstpage :

2122

Lastpage :

2129

Abstract :

Structural semantics are fundamental to understanding both natural and man-made objects from languages to buildings. They are manifested as repeated structures or patterns and are often captured in images. Finding repeated patterns in images, therefore, has important applications in scene understanding, 3D reconstruction, and image retrieval as well as image compression. Previous approaches in visual-pattern mining limited themselves by looking for frequently co-occurring features within a small neighborhood in an image. However, semantics of a visual pattern are typically defined by specific spatial relationships between features regardless of the spatial proximity. In this paper, semantics are represented as visual elements and geometric relationships between them. A novel unsupervised learning algorithm finds pair-wise associations of visual elements that have consistent geometric relationships sufficiently often. The algorithms are efficient - maximal matchings are determined without combinatorial search. High-order structural semantics are extracted by mining patterns that are composed of pairwise spatially consistent associations of visual elements. We demonstrate the effectiveness of our approach for discovering repeated visual patterns on a variety of image collections.

Keywords :

image matching; image reconstruction; image retrieval; semantic networks; unsupervised learning; 3D image reconstruction; efficient maximal matchings; high-order structural semantics; image collections; image compression; image retrieval; man-made objects; scene understanding; spatial proximity; unsupervised learning; visual-pattern mining; Buildings; Costs; Eyes; Image retrieval; Layout; Polynomials; Unsupervised learning; Virtual environment; Visualization; Windows;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computer Vision, 2009 IEEE 12th International Conference on

Conference_Location :

Kyoto

ISSN :

1550-5499

Print_ISBN :

978-1-4244-4420-5

Electronic_ISBN :

1550-5499

Type :

conf

DOI :

10.1109/ICCV.2009.5459465

Filename :

5459465

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2293481