DocumentCode
2708060
Title
State of the art in metadata abstraction crawlers
Author
Dong, Hai ; Hussain, Farookh Khadeer ; Chang, Elizabeth
Author_Institution
Digital Ecosyst. & Bus. Intell. Inst., Curtin Univ. of Technol., Perth, WA
fYear
2008
fDate
21-24 April 2008
Firstpage
1
Lastpage
6
Abstract
Nowadays, the research of crawlers moves closer to the semantic web, along with the appearance of increasing XML/RDF/OWL files and the rapid development of ontology mark-up languages. As an emerging concept, metadata abstraction crawlers are a series of crawlers that aim to abstract metadata from normal HTML documents, based on various semantic Web technologies. In this paper, we make a general survey of the current situation of metadata abstraction crawlers. Fourteen cases in this field are chosen as typical examples, and classified in five clusters. From seven perspectives we horizontally compare and contrast the semantic Web crawlers in each cluster, and draw our conclusion in the final section.
Keywords
data structures; meta data; semantic Web; OWL files; RDF; XML; metadata abstraction crawlers; ontology mark-up languages; semantic Web technologies; Australia; Crawlers; Ecosystems; HTML; OWL; Ontologies; Organizing; Resource description framework; Semantic Web; XML; OAI-PMH; RDF crawlers; focused crawlers; metadata abstraction; semantic web crawlers;
fLanguage
English
Publisher
ieee
Conference_Titel
Industrial Technology, 2008. ICIT 2008. IEEE International Conference on
Conference_Location
Chengdu
Print_ISBN
978-1-4244-1705-6
Electronic_ISBN
978-1-4244-1706-3
Type
conf
DOI
10.1109/ICIT.2008.4608573
Filename
4608573
Link To Document