Title :
Study and application of web data mining based on XML
Author :
Zhang, Pengwei ; Chen, Jingxia
Author_Institution :
Electr. & Inf. Eng. Coll., Shaanxi Univ. of Sci. & Technol., Xi´´an, China
Abstract :
With the development of information technologies, web data mining has been put forward and in wide research. It is defined as the discovery, extraction and analysis of useful and potential information from the World Wide Web. But much of inhomogeneous and anomalistic and dynamic updated semi-structured data in web pages makes web data mining difficult. To solve this problem, on the basis of analyzing the characteristics of XML, the paper presents a web data mining model on XML, and introduces the method to implement the model with XML and Java technologies in detail with the combination of an instance. Finally, some valuable discussions are put forward on this model for its shortages.
Keywords :
Data engineering; Data mining; Databases; Educational institutions; Educational technology; HTML; Information technology; Java; Paper technology; XML; RDF; Web; XML; data mining; semi-structured;
Conference_Titel :
Educational and Network Technology (ICENT), 2010 International Conference on
Conference_Location :
Qinhuangdao, China
Print_ISBN :
978-1-4244-7660-2
Electronic_ISBN :
978-1-4244-7662-6
DOI :
10.1109/ICENT.2010.5532169