DocumentCode :
124136
Title :
Learning Concise Pattern for Interlinking with Extended Version Space
Author :
Fan, Zhe ; Euzenat, Jerome ; Scharffe, Francois
Author_Institution :
INRIA & LIG, St. Ismier, France
Volume :
1
fYear :
2014
fDate :
11-14 Aug. 2014
Firstpage :
70
Lastpage :
77
Abstract :
Many data sets on the web contain analogous data which represent the same resources in the world, so it is helpful to interlink different data sets for sharing information. However, finding correct links is very challenging because there are many instances to compare. In this paper, an interlinking method is proposed to interlink instances across different data sets. The input is class correspondences, property correspondences and a set of sample links that are assessed by users as either "positive" or "negative". We apply a machine learning method, Version Space, in order to construct a classifier, which is called interlinking pattern, that can justify correct links and incorrect links for both data sets. We improve the learning method so that it resolves the no-conjunctive-pattern problem. We call it Extended Version Space. Experiments confirm that our method with only 1% of sample links already reaches a high F-measure (around 0.96-0.99). The F-measure quickly converges, being improved by nearly 10% than other comparable approaches.
Keywords :
Internet; learning (artificial intelligence); pattern classification; statistical analysis; F-measure; World Wide Web; analogous data; class correspondences; classifier; data sets interlinking method; extended version space; information sharing; interlinking pattern; machine learning method; no-conjunctive-pattern problem; property correspondences; sample links; Electronic mail; Genetic programming; Ontologies; Resource description framework; Runtime; Supervised learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Web Intelligence (WI) and Intelligent Agent Technologies (IAT), 2014 IEEE/WIC/ACM International Joint Conferences on
Conference_Location :
Warsaw
Type :
conf
DOI :
10.1109/WI-IAT.2014.18
Filename :
6927527
Link To Document :
بازگشت