DocumentCode :
3079400
Title :
Ontology guided autonomous label assignment in wrapper induced tables with missing column names
Author :
Amin, Mohammad Shafkat ; Jamil, Hasan
Author_Institution :
Dept. of Comput. Sci., Wayne State Univ., Detroit, MI, USA
fYear :
2009
fDate :
10-12 Aug. 2009
Firstpage :
424
Lastpage :
425
Abstract :
Formulating and executing queries over distributed, autonomous and heterogeneous resources is an important research area. The advent of the Internet and the Web and their inherent ubiquity have brought forth opportunities to query these information sources in an automated and independent manner. In the domain of information extraction, automatic wrapper generation has been well studied but the efficacy of the current wrappers are limited by the fact that automatic annotation of column names to the extracted tabular data is yet to be perfected. In this paper, we propose a novel annotation system that can assign meaningful column names to the extracted tables for subsequent queries. We enhance our prototype wrapper system FastWrap with this annotator to support fast and autonomous on-the-fly data integration and ad hoc declarative querying.
Keywords :
ontologies (artificial intelligence); query formulation; Internet; ad hoc declarative querying; annotation system; automatic wrapper generation; autonomous label assignment; data integration; information extraction; missing column names; ontology; query formulation; wrapper system FastWrap; wrapper-induced tables; Computer science; Data mining; Databases; HTML; Humans; Internet; Ontologies; Prototypes; Search engines; Web page design;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Reuse & Integration, 2009. IRI '09. IEEE International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4244-4114-3
Electronic_ISBN :
978-1-4244-4116-7
Type :
conf
DOI :
10.1109/IRI.2009.5211591
Filename :
5211591
Link To Document :
بازگشت