مرکز منطقه ای اطلاع رساني علوم و فناوري - Sampling and Ontologically Pooling Web Images for Visual Concept Learning

DocumentCode :

1465896

Title :

Sampling and Ontologically Pooling Web Images for Visual Concept Learning

Author :

Zhu, Shiai ; Ngo, Chong-Wah ; Jiang, Yu-Gang

Author_Institution :

Dept. of Comput. Sci., City Univ. of Hong Kong, Kowloon, China

Volume :

Issue :

fYear :

2012

Firstpage :

1068

Lastpage :

1078

Abstract :

Sufficient training examples are essential for effective learning of semantic visual concepts. In practice, however, acquiring noise-free training examples has always been expensive. Recently the rapid popularization of social media websites, such as Flickr, has made it possible to collect training exemplars without human assistance. This paper proposes a novel and efficient approach to collect training samples from the noisily tagged Web images for visual concept learning, where we try to maximize two important criteria, relevancy and coverage, of the automatically generated training sets. For the former, a simple method named semantic field is introduced to handle the imprecise and incomplete image tags. Specifically, the relevancy of an image to a target concept is predicted by collectively analyzing the associated tag list of the image using two knowledge sources: WordNet corpus and statistics from Flickr.com. To boost the coverage or diversity of the training sets, we further propose an ontology-based hierarchical pooling method to collect samples not only based on the target concept alone, but also from ontologically neighboring concepts. Extensive experiments on three different datasets (NUS-WIDE, PASCAL VOC, and ImageNet) demonstrate the effectiveness of our proposed approach, producing competitive performance even when comparing with concept classifiers learned using expert- labeled training examples.

Keywords :

Internet; computer aided instruction; computer vision; ontologies (artificial intelligence); social networking (online); Flickr; WordNet corpus; computer vision; human assistance; image tags; ontologically pooling Web image sampling; semantic field; semantic visual concepts; social media Websites; visual concept learning; Animals; Manuals; Media; Semantics; Training; Training data; Visualization; Training set construction; visual concept learning; web images;

fLanguage :

English

Journal_Title :

Multimedia, IEEE Transactions on

Publisher :

ieee

ISSN :

1520-9210

Type :

jour

DOI :

10.1109/TMM.2012.2190387

Filename :

6166364

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1465896