• DocumentCode
    2708060
  • Title

    State of the art in metadata abstraction crawlers

  • Author

    Dong, Hai ; Hussain, Farookh Khadeer ; Chang, Elizabeth

  • Author_Institution
    Digital Ecosyst. & Bus. Intell. Inst., Curtin Univ. of Technol., Perth, WA
  • fYear
    2008
  • fDate
    21-24 April 2008
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Nowadays, the research of crawlers moves closer to the semantic web, along with the appearance of increasing XML/RDF/OWL files and the rapid development of ontology mark-up languages. As an emerging concept, metadata abstraction crawlers are a series of crawlers that aim to abstract metadata from normal HTML documents, based on various semantic Web technologies. In this paper, we make a general survey of the current situation of metadata abstraction crawlers. Fourteen cases in this field are chosen as typical examples, and classified in five clusters. From seven perspectives we horizontally compare and contrast the semantic Web crawlers in each cluster, and draw our conclusion in the final section.
  • Keywords
    data structures; meta data; semantic Web; OWL files; RDF; XML; metadata abstraction crawlers; ontology mark-up languages; semantic Web technologies; Australia; Crawlers; Ecosystems; HTML; OWL; Ontologies; Organizing; Resource description framework; Semantic Web; XML; OAI-PMH; RDF crawlers; focused crawlers; metadata abstraction; semantic web crawlers;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Industrial Technology, 2008. ICIT 2008. IEEE International Conference on
  • Conference_Location
    Chengdu
  • Print_ISBN
    978-1-4244-1705-6
  • Electronic_ISBN
    978-1-4244-1706-3
  • Type

    conf

  • DOI
    10.1109/ICIT.2008.4608573
  • Filename
    4608573