DocumentCode :
3333114
Title :
Automatic indexing and content-based retrieval of captioned photographs
Author :
Srihari, Rohini K.
Author_Institution :
Center of Excellence for Document Anal. & Recognition, State Univ. of New York, Buffalo, NY, USA
Volume :
2
fYear :
1995
fDate :
14-16 Aug 1995
Firstpage :
1165
Abstract :
This research explores the interaction of textual and photographic information in an integrated text/image database environment. Specifically, We present a content-based retrieval system for captioned group photographs of people (i.e., human faces) where groups can consist of one or more members. By understanding the caption accompanying a picture, we are able to extract information useful in (i) retrieving the picture and (ii) directing an image interpretation system identify relevant objects (in this case, faces) in the picture. For the latter, we incorporate techniques from our ongoing research on photo understanding using accompanying text. Current image-based techniques have limitations; for example, similarity techniques used for retrieving faces will not perform well in group photographs where the locations of faces is not known a priori or where face sizes are small. By exploiting caption information, we assist a face locator in detecting human faces in a photograph and subsequently labelling them. Text-based similarity algorithms have principally relied on statistical techniques to index and classify documents (e.g., vector models). It is necessary to employ natural language processing techniques in order to derive deeper semantics from captions which contain far fewer words than documents. Our approach is unique since it goes beyond a superficial combination of existing text-based and image-based approaches to information retrieval
Keywords :
face recognition; indexing; natural languages; query processing; visual databases; automatic indexing; captioned photographs; content-based retrieval; face locator; human faces; information retrieval; natural language processing; text/image database; Content based retrieval; Data mining; Face detection; Humans; Image databases; Image retrieval; Information retrieval; Labeling; Machine assisted indexing; Natural language processing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition, 1995., Proceedings of the Third International Conference on
Conference_Location :
Montreal, Que.
Print_ISBN :
0-8186-7128-9
Type :
conf
DOI :
10.1109/ICDAR.1995.602129
Filename :
602129
Link To Document :
بازگشت