Title :
Image Interpretation Using Large Corpus: Wikipedia
Author :
Rahurkar, Mandar ; Tsai, Shen-Fu ; Dagli, Charlie ; Huang, Thomas S.
Author_Institution :
Beckman Inst., Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
Abstract :
Image is a powerful medium for expressing one´s ideas and rightly confirms the adage, “One picture is worth a thousand words.” In this work, we explore the application of world knowledge in the form of Wikipedia to achieve this objective-literally. In the first part, we disambiguate and rank semantic concepts associated with ambiguous keywords by exploiting link structure of articles in Wikipedia. In the second part, we explore an image representation in terms of keywords which reflect the semantic content of an image. Our approach is inspired by the desire to augment low-level image representation with massive amounts of world knowledge, to facilitate computer vision tasks like image retrieval based on this information. We represent an image as a weighted mixture of a predetermined set of concrete concepts whose definition has been agreed upon by a wide variety of audience. To achieve this objective, we use concepts defined by Wikipedia articles, e.g., sky, building, or automobile. An important advantage of our approach is availability of vast amounts of highly organized human knowledge in Wikipedia. Wikipedia evolves rapidly steadily increasing its breadth and depth over time.
Keywords :
computer vision; image representation; image retrieval; semantic Web; Wikipedia; computer vision; image interpretation; image representation; image retrieval; rank semantic concepts; Automobiles; Computer vision; Concrete; Encyclopedias; Humans; Image representation; Image retrieval; Information retrieval; Search engines; Visual system; Wikipedia; Concepts; Wikipedia; image understanding;
Journal_Title :
Proceedings of the IEEE
DOI :
10.1109/JPROC.2010.2050410