DocumentCode :
2293481
Title :
Unsupervised learning of high-order structural semantics from images
Author :
Gao, Jizhou ; Hu, Yin ; Liu, Jinze ; Yang, Ruigang
Author_Institution :
Center for Visualization & Virtual Environments, Univ. of Kentucky, Lexington, KY, USA
fYear :
2009
fDate :
Sept. 29 2009-Oct. 2 2009
Firstpage :
2122
Lastpage :
2129
Abstract :
Structural semantics are fundamental to understanding both natural and man-made objects from languages to buildings. They are manifested as repeated structures or patterns and are often captured in images. Finding repeated patterns in images, therefore, has important applications in scene understanding, 3D reconstruction, and image retrieval as well as image compression. Previous approaches in visual-pattern mining limited themselves by looking for frequently co-occurring features within a small neighborhood in an image. However, semantics of a visual pattern are typically defined by specific spatial relationships between features regardless of the spatial proximity. In this paper, semantics are represented as visual elements and geometric relationships between them. A novel unsupervised learning algorithm finds pair-wise associations of visual elements that have consistent geometric relationships sufficiently often. The algorithms are efficient - maximal matchings are determined without combinatorial search. High-order structural semantics are extracted by mining patterns that are composed of pairwise spatially consistent associations of visual elements. We demonstrate the effectiveness of our approach for discovering repeated visual patterns on a variety of image collections.
Keywords :
image matching; image reconstruction; image retrieval; semantic networks; unsupervised learning; 3D image reconstruction; efficient maximal matchings; high-order structural semantics; image collections; image compression; image retrieval; man-made objects; scene understanding; spatial proximity; unsupervised learning; visual-pattern mining; Buildings; Costs; Eyes; Image retrieval; Layout; Polynomials; Unsupervised learning; Virtual environment; Visualization; Windows;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision, 2009 IEEE 12th International Conference on
Conference_Location :
Kyoto
ISSN :
1550-5499
Print_ISBN :
978-1-4244-4420-5
Electronic_ISBN :
1550-5499
Type :
conf
DOI :
10.1109/ICCV.2009.5459465
Filename :
5459465
Link To Document :
بازگشت