DocumentCode
2352848
Title
Clustering art
Author
Barnard, Kobus ; Duygulu, Pinar ; Forsyth, David
Author_Institution
Comput. Div., California Univ., Berkeley, CA, USA
Volume
2
fYear
2001
fDate
2001
Abstract
We extend a recently developed method (K. Barnard and D. Forsyth, 2001) for learning the semantics of image databases using text and pictures. We incorporate statistical natural language processing in order to deal with free text. We demonstrate the current system on a difficult dataset, namely 10000 images of work from the Fine Arts Museum of San Francisco. The images include line drawings, paintings, and pictures of sculpture and ceramics. Many of the images have associated free text which varies greatly from physical description to interpretation and mood. We use WordNet to provide semantic grouping information and to help disambiguate word senses, as well as emphasize the hierarchical nature of semantic relationships. This allows us to impose a natural structure on the image collection that reflects semantics to a considerable degree. Our method produces a joint probability distribution for words and picture elements. We demonstrate that this distribution can be used: (a) to provide illustrations for given captions, and (b) to generate words for images outside the training set. Results from this annotation process yield a quantitative study of our method. Finally, the annotation process can be seen as a form of object recognizer that has been learned through a partially supervised process.
Keywords
art; computational linguistics; document image processing; natural languages; text analysis; visual databases; Fine Arts Museum; WordNet; annotation process; art clustering; ceramics; dataset; free text; hierarchical nature; image collection; image databases; joint probability distribution; line drawings; natural structure; object recognizer; partially supervised process; quantitative study; sculpture; semantic grouping information; semantic relationships; statistical natural language processing; training set; word sense disambiguation; Art; Ceramics; Image databases; Image generation; Mood; Natural language processing; Painting; Predictive models; Probability distribution; Subspace constraints;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Vision and Pattern Recognition, 2001. CVPR 2001. Proceedings of the 2001 IEEE Computer Society Conference on
ISSN
1063-6919
Print_ISBN
0-7695-1272-0
Type
conf
DOI
10.1109/CVPR.2001.990994
Filename
990994
Link To Document