DocumentCode :
3407167
Title :
An automated integration approach for semi-structured and structured data
Author :
Lim, Seung-Jin ; Ng, Yiu-Kai
Author_Institution :
Dept. of Comput. Sci., Brigham Young Univ., Provo, UT, USA
fYear :
2001
fDate :
2001
Firstpage :
12
Lastpage :
21
Abstract :
As data access beyond the traditional intranet boundary is popular on the Internet these days, the demand for an integrated and uniform method for accessing Web data sources that are different in structures and semantics is increasing. This demand is partly driven by users who want to access more diverse information, such as up-to-date information on stock market, entertainment, news, and science. The demand is also partly driven by information providers who provide information service to customers on the Web. The authors present an approach to integrate semi-structured data sources and structured data sources by using an automated structure resolution approach. The structure resolution approach can easily be adopted to i) integrate existing relations in the relational database model into semi-structured data sources, and ii) merge sets of semi-structured data that have different structures with no human intervention. The integration of multiple data sources by using our approach results in the unified view (UV) of the data sources, which is presented in an XML DTD format. UV can be used for query optimization on heterogeneous data sources
Keywords :
data structures; distributed databases; hypermedia markup languages; information resources; information retrieval; merging; relational databases; UV; Web data source access; XML DTD format; automated integration approach; automated structure resolution approach; diverse information; entertainment; heterogeneous data sources; information providers; information service; multiple data sources; news; query optimization; relational database model; science; semi-structured data sources; stock market; structured data sources; unified view; up-to-date information; Computer science; Data analysis; Data models; Data warehouses; Humans; Internet; Query processing; Relational databases; Stock markets; XML;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cooperative Database Systems for Advanced Applications, 2001. CODAS 2001. The Proceedings of the Third International Symposium on
Conference_Location :
Beijing
Print_ISBN :
0-7695-1128-7
Type :
conf
DOI :
10.1109/CODAS.2001.945144
Filename :
945144
Link To Document :
بازگشت