Title :
Using Swarm intelligence for XML Clustering
Author :
Wang, Tong ; Liu, Daxin ; Lin, Xuanzuo ; Sun, Xiaohua
Author_Institution :
Harbin Eng. Univ.
Abstract :
Data mining in large-scale XML documents set can facilitate to query and manage XML documents. This paper proposes a novel XML document clustering method based on swarm intelligence. Firstly, the approach extracts path sequences from documents, and then the documents are transformed to vectors in a high-dimensional Euclidean space. Finally, CSX clustering method is applied to with high performance. The advantage of the approach is that swarm intelligence can help skip out of the local optima of the search space. Data sets are obtained from DBLP, and the experiment results show that the performance of the proposed techniques outperformed the standard C-means method in clustering compact and accuracy
Keywords :
XML; data mining; optimisation; pattern clustering; search problems; CSX clustering method; DBLP; XML clustering; c-means method; data mining; high-dimensional Euclidean space; large-scale XML documents; path sequences extraction; swarm intelligence; Agricultural engineering; Agriculture; Books; Clustering methods; Data mining; Electronic mail; Large-scale systems; Particle swarm optimization; Sun; XML; XML; clustering; data mining; swarm intelligence;
Conference_Titel :
Intelligent Control and Automation, 2006. WCICA 2006. The Sixth World Congress on
Conference_Location :
Dalian
Print_ISBN :
1-4244-0332-4
DOI :
10.1109/WCICA.2006.1714231