مرکز منطقه ای اطلاع رساني علوم و فناوري - Bridging the Semantic Gap Between Image Contents and Tags

DocumentCode :

1504125

Title :

Bridging the Semantic Gap Between Image Contents and Tags

Author :

Ma, Hao ; Zhu, Jianke ; Lyu, Michael Rung-Tsong ; King, Irwin

Author_Institution :

Dept. of Comput. Sci. & Eng., Chinese Univ. of Hong Kong, Kowloon, China

Volume :

Issue :

fYear :

2010

Firstpage :

462

Lastpage :

473

Abstract :

With the exponential growth of Web 2.0 applications, tags have been used extensively to describe the image contents on the Web. Due to the noisy and sparse nature in the human generated tags, how to understand and utilize these tags for image retrieval tasks has become an emerging research direction. As the low-level visual features can provide fruitful information, they are employed to improve the image retrieval results. However, it is challenging to bridge the semantic gap between image contents and tags. To attack this critical problem, we propose a unified framework in this paper which stems from a two-level data fusions between the image contents and tags: 1) A unified graph is built to fuse the visual feature-based image similarity graph with the image-tag bipartite graph; 2) A novel random walk model is then proposed, which utilizes a fusion parameter to balance the influences between the image contents and tags. Furthermore, the presented framework not only can naturally incorporate the pseudo relevance feedback process, but also it can be directly applied to applications such as content-based image retrieval, text-based image retrieval, and image annotation. Experimental analysis on a large Flickr dataset shows the effectiveness and efficiency of our proposed framework.

Keywords :

Internet; content-based retrieval; graph theory; image fusion; image retrieval; Flickr dataset; Web 2.0 applications; data fusions; image contents; image retrieval; image-tag bipartite graph; novel random walk model; pseudo relevance feedback process; visual feature-based image similarity graph; Content-based image retrieval; image annotation; random walk; text-based image retrieval;

fLanguage :

English

Journal_Title :

Multimedia, IEEE Transactions on

Publisher :

ieee

ISSN :

1520-9210

Type :

jour

DOI :

10.1109/TMM.2010.2051360

Filename :

5473143

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1504125