DocumentCode :
1448688
Title :
Constructing Concept Lexica With Small Semantic Gaps
Author :
Lu, Yijuan ; Zhang, Lei ; Liu, Jiemin ; Tian, Qi
Author_Institution :
Dept. of Comput. Sci., Texas State Univ., San Marcos, TX, USA
Volume :
12
Issue :
4
fYear :
2010
fDate :
6/1/2010 12:00:00 AM
Firstpage :
288
Lastpage :
299
Abstract :
In recent years, constructing mathematical models for visual concepts by using content features, i.e., color, texture, shape, or local features, has led to the fast development of concept-based multimedia retrieval. In concept-based multimedia retrieval, defining a good lexicon of high-level concepts is the first and important step. However, which concepts should be used for data collection and model construction is still an open question. People agree that concepts that can be easily described by low-level visual features can construct a good lexicon. These concepts are called concepts with small semantic gaps. Unfortunately, there is very little research found on semantic gap analysis and on automatically choosing multimedia concepts with small semantic gaps, even though differences of semantic gaps among concepts are well worth investigating. In this paper, we propose a method to quantitatively analyze semantic gaps and develop a novel framework to identify high-level concepts with small semantic gaps from a large-scale web image dataset. Images with small semantic gaps are selected and clustered first by defining a confidence score and a content-context similarity matrix in visual space and textual space. Then, from the surrounding descriptions (titles, categories, and comments) of these images, concepts with small semantic gaps are automatically mined. In addition, considering that semantic gap analysis depends on both features and content-contextual consistency, we construct a lexicon family of high-level concepts with small semantic gaps (LCSS) based on different low-level features and different consistency measurements. This set of lexica is both independent to each other and mutually complimentary. LCSS is very helpful for data collection, feature selection, annotation, and modeling for large-scale image retrieval. It also shows a promising application potential for image annotation refinement and rejection. The experimental results demonstrate the validity- - of the developed concept lexica.
Keywords :
image retrieval; multimedia computing; concept lexica; concept-based multimedia retrieval; content features; content-context similarity matrix; content-contextual consistency; data collection; feature selection; high-level concepts; image annotation refinement; large-scale Web image dataset; large-scale image retrieval; low level visual features; mathematical models; model construction; multimedia concepts; semantic gap analysis; small semantic gaps; textual space; visual concepts; visual space; Image retrieval; large-scale; lexica; semantic gap;
fLanguage :
English
Journal_Title :
Multimedia, IEEE Transactions on
Publisher :
ieee
ISSN :
1520-9210
Type :
jour
DOI :
10.1109/TMM.2010.2046292
Filename :
5437225
Link To Document :
بازگشت