• DocumentCode
    124136
  • Title

    Learning Concise Pattern for Interlinking with Extended Version Space

  • Author

    Fan, Zhe ; Euzenat, Jerome ; Scharffe, Francois

  • Author_Institution
    INRIA & LIG, St. Ismier, France
  • Volume
    1
  • fYear
    2014
  • fDate
    11-14 Aug. 2014
  • Firstpage
    70
  • Lastpage
    77
  • Abstract
    Many data sets on the web contain analogous data which represent the same resources in the world, so it is helpful to interlink different data sets for sharing information. However, finding correct links is very challenging because there are many instances to compare. In this paper, an interlinking method is proposed to interlink instances across different data sets. The input is class correspondences, property correspondences and a set of sample links that are assessed by users as either "positive" or "negative". We apply a machine learning method, Version Space, in order to construct a classifier, which is called interlinking pattern, that can justify correct links and incorrect links for both data sets. We improve the learning method so that it resolves the no-conjunctive-pattern problem. We call it Extended Version Space. Experiments confirm that our method with only 1% of sample links already reaches a high F-measure (around 0.96-0.99). The F-measure quickly converges, being improved by nearly 10% than other comparable approaches.
  • Keywords
    Internet; learning (artificial intelligence); pattern classification; statistical analysis; F-measure; World Wide Web; analogous data; class correspondences; classifier; data sets interlinking method; extended version space; information sharing; interlinking pattern; machine learning method; no-conjunctive-pattern problem; property correspondences; sample links; Electronic mail; Genetic programming; Ontologies; Resource description framework; Runtime; Supervised learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Web Intelligence (WI) and Intelligent Agent Technologies (IAT), 2014 IEEE/WIC/ACM International Joint Conferences on
  • Conference_Location
    Warsaw
  • Type

    conf

  • DOI
    10.1109/WI-IAT.2014.18
  • Filename
    6927527