Title :
Clustering XML Documents Based on the Weight of Frequent Structures
Author :
Hwang, Jeong Hee ; Gu, Mi Sug
Author_Institution :
Namseoul Univ., Chonan
Abstract :
The previous clustering methods of XML document group XML documents with similar structures, measuring structural similarity and distance between XML documents. In this paper, however, we propose a novel clustering method for XML documents using the weight of frequent structures in XML documents, considering that an XML document as a transaction and the extracted structures from XML documents as items of a transaction. Our experiment results show the high speed and cluster cohesion of our clustering method.
Keywords :
XML; document handling; XML documents; clustering methods; structural similarity; Bioinformatics; Books; Clustering algorithms; Clustering methods; Computer science; Databases; Information technology; Internet; Laboratories; XML;
Conference_Titel :
Convergence Information Technology, 2007. International Conference on
Conference_Location :
Gyeongju
Print_ISBN :
0-7695-3038-9
DOI :
10.1109/ICCIT.2007.101