DocumentCode
3100786
Title
Web Mining of Relations from XML and Construct Database Schema
Author
Zhou, Xu ; Pan, Xuezeng ; Ren, Yu
Author_Institution
Coll. of Comput. Sci., Zhejiang Univ., Hangzhou
fYear
2006
fDate
Nov. 28 2006-Dec. 1 2006
Firstpage
211
Lastpage
211
Abstract
Increasing amount of commercial data is presented in XML format for exchanging or publishing on the Web. It is emerging as a new standard for information representation and exchanging over the Internet. How to retrieve valuable information from XML documents on the Web is a new challenge to data mining research. Compared with relational database, XML data in documents is stored as file with tree logical structure inside, it results in lower efficiency and performance in directly querying data. So it is still necessary to transform data into database (warehouse) for data mining afterwards. In this paper, we present a scheme to analyze relation of elements in XML on the Web, and construct relational database schema based on the analysis. During the process, there would be a worthy accessory product - a glossary, which can facilitate the process of data mining warehouse designing and building.
Keywords
Internet; XML; data mining; relational databases; Internet; Web mining; World Wide Web; XML data; XML documents; XML format; construct database schema; data mining warehouse; data querying; eXtensible Markup Language; information representation; information retrieval; relational database schema; tree logical structure; Buildings; Data mining; Information representation; Information retrieval; Internet; Publishing; Relational databases; Terminology; Web mining; XML;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational Intelligence for Modelling, Control and Automation, 2006 and International Conference on Intelligent Agents, Web Technologies and Internet Commerce, International Conference on
Conference_Location
Sydney, NSW
Print_ISBN
0-7695-2731-0
Type
conf
DOI
10.1109/CIMCA.2006.233
Filename
4052827
Link To Document