Title :
Ontology guided autonomous label assignment in wrapper induced tables with missing column names
Author :
Amin, Mohammad Shafkat ; Jamil, Hasan
Author_Institution :
Dept. of Comput. Sci., Wayne State Univ., Detroit, MI, USA
Abstract :
Formulating and executing queries over distributed, autonomous and heterogeneous resources is an important research area. The advent of the Internet and the Web and their inherent ubiquity have brought forth opportunities to query these information sources in an automated and independent manner. In the domain of information extraction, automatic wrapper generation has been well studied but the efficacy of the current wrappers are limited by the fact that automatic annotation of column names to the extracted tabular data is yet to be perfected. In this paper, we propose a novel annotation system that can assign meaningful column names to the extracted tables for subsequent queries. We enhance our prototype wrapper system FastWrap with this annotator to support fast and autonomous on-the-fly data integration and ad hoc declarative querying.
Keywords :
ontologies (artificial intelligence); query formulation; Internet; ad hoc declarative querying; annotation system; automatic wrapper generation; autonomous label assignment; data integration; information extraction; missing column names; ontology; query formulation; wrapper system FastWrap; wrapper-induced tables; Computer science; Data mining; Databases; HTML; Humans; Internet; Ontologies; Prototypes; Search engines; Web page design;
Conference_Titel :
Information Reuse & Integration, 2009. IRI '09. IEEE International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4244-4114-3
Electronic_ISBN :
978-1-4244-4116-7
DOI :
10.1109/IRI.2009.5211591