DocumentCode :
1559478
Title :
Finding and labeling the subject of a captioned depictive natural photograph
Author :
Rowe, Neil C.
Author_Institution :
US Naval Postgraduate Sch., Monterey, CA, USA
Volume :
14
Issue :
1
fYear :
2002
Firstpage :
202
Lastpage :
207
Abstract :
We address the problem of finding the subject of a photographic image intended to illustrate some physical object or objects ("depictive") and taken by usual optical means without magnification ("natural"). This could help in developing digital image libraries since important image properties like subject size and color of a photograph are not usually mentioned in accompanying captions and can help rank the photograph retrievals for a user. We explore an approach that identifies the "visual focus" of the image and the "depicted concepts" in a caption and connects them. The visual focus is determined by using eight domain-independent characteristics of regions in the segmented image, and the caption depiction is identified by a set a rules applied to the parsed and interpreted caption. The visual-focus determination also does combinatorial optimization on sets of regions to find the set that best satisfies focus criteria. Experiments on 100 randomly selected image-caption pairs show significant improvement in precision of retrieval over simpler methods, and, particularly, emphasizes the value of segmentation of the image
Keywords :
database indexing; image retrieval; natural language interfaces; visual databases; caption; digital image libraries; information retrieval; multimedia; natural-language understanding; photograph; photographic image; visual focus; Labeling;
fLanguage :
English
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1041-4347
Type :
jour
DOI :
10.1109/69.979983
Filename :
979983
Link To Document :
بازگشت