• DocumentCode
    1787453
  • Title

    CrowdLink: Crowdsourcing for Large-Scale Linked Data Management

  • Author

    Basharat, Arslan ; Arpinar, I. Budak ; Dastgheib, Shima ; Kursuncu, Ugur ; Kochut, Krys ; Dogdu, Erdogan

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Georgia, Athens, GA, USA
  • fYear
    2014
  • fDate
    16-18 June 2014
  • Firstpage
    227
  • Lastpage
    234
  • Abstract
    Crowd sourcing is an emerging paradigm to exploit the notion of human-computation for solving various computational problems, which cannot be accurately solved solely by the machine-based solutions. We use crowd sourcing for large-scale link management in the Semantic Web. More specifically, we develop Crowd Link, which utilizes crowd workers for verification and creation of triples in Linking Open Data (LOD). LOD incorporates the core data sets in the Semantic Web, yet is not in full conformance with the guidelines for publishing high quality linked data on the Web. Our approach can help in enriching and improving quality of mission-critical links in LOD. Scalable LOD link management requires a hybrid approach, where human intelligent and machine intelligent tasks interleave in a workflow execution. Likewise, many other crowd sourcing applications require a sophisticated workflow specification not only on human intelligent tasks, but also machine intelligent tasks to handle data and control-flow, which is strictly deficient in the existing crowd sourcing platforms. Hence, we are strongly motivated to investigate the interplay of crowd sourcing, and semantically enriched workflows for better human-machine cooperation in task completion. We demonstrate usefulness of our approach through various link creation and verification tasks, and workflows using Amazon Mechanical Turk. Experimental evaluation demonstrates promising results in terms of accuracy of the links created, and verified by the crowd workers.
  • Keywords
    data handling; information retrieval; semantic Web; Amazon Mechanical Turk; CrowdLink; LOD; crowd sourcing; crowd workers; human intelligent task; human-computation notion; large-scale linked data management; link creation task; link verification task; linking open data; machine intelligent task; machine-based solutions; semantic Web; workflow execution; workflow specification; Accuracy; Crowdsourcing; Engines; Generators; Man machine systems; Ontologies; Semantics; Crowdsourcing; Human Intelligent Task; Linking Open Data (LOD); Ontology Verification and Entity Disambiguation; Semantic Web; Workflow Management;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Semantic Computing (ICSC), 2014 IEEE International Conference on
  • Conference_Location
    Newport Beach, CA
  • Print_ISBN
    978-1-4799-4002-8
  • Type

    conf

  • DOI
    10.1109/ICSC.2014.14
  • Filename
    6882027