Title :
Efficient data clustering over peer-to-peer networks
Author :
Elgohary, Ahmed ; Ismail, Mohamed A.
Author_Institution :
Comput. & Syst. Eng. Dept., Alexandria Univ., Alexandria, Egypt
Abstract :
Due to the dramatic increase of data volumes in different applications, it is becoming infeasible to keep these data in one centralized machine. It is becoming more and more natural to deal with distributed databases and networks. That is why distributed data mining techniques have been introduced. One of the most important data mining problems is data clustering. While many clustering algorithms exist for centralized databases, there is a lack of efficient algorithms for distributed databases. In this paper, an efficient algorithm is proposed for clustering distributed databases. The proposed methodology employs an iterative optimization technique to achieve better clustering objective. The experimental results reported in this paper show the superiority of the proposed technique over a recently proposed algorithm based on a distributed version of the well known K-Means algorithm (Datta et al. 2009) [1].
Keywords :
data mining; distributed databases; iterative methods; optimisation; pattern clustering; peer-to-peer computing; K-means algorithm; centralized databases; data clustering; distributed data mining; distributed databases; iterative optimization; peer-to-peer networks; Algorithm design and analysis; Clustering algorithms; Data mining; Distributed databases; Heuristic algorithms; Peer to peer computing; Synchronization; Clustering Over P2P Networks; Distributed Data Mining; Distributed K-Means Clustering; Iterative Optimization; Minimum Variance Clustering;
Conference_Titel :
Intelligent Systems Design and Applications (ISDA), 2011 11th International Conference on
Conference_Location :
Cordoba
Print_ISBN :
978-1-4577-1676-8
DOI :
10.1109/ISDA.2011.6121656