DocumentCode
2035024
Title
Implementation of distributed ROCK algorithm for clustering of large categorical datasets and its performance analysis
Author
Patidar, Anil ; Joshi, Ritesh ; Mishra, Surendra
Author_Institution
MCA, MITM, Indore, India
Volume
2
fYear
2011
fDate
8-10 April 2011
Firstpage
79
Lastpage
83
Abstract
Clustering in data mining, is useful to discover distribution patterns in the underlying data. ROCK is one such hierarchical clustering algorithm, which works on sampled data. We show that sequential ROCK algorithm is time consuming for large dataset. Instead, we present distributed algorithms with better performance than known algorithms. We develop a robust hierarchical clustering algorithm ROCK that employs preliminary calculations to be done at different processors. In addition to presenting detailed complexity results for DROCK we also conduct an experimental study with real life data sets to demonstrate the effectiveness of our technique.
Keywords
data mining; distributed algorithms; pattern clustering; data mining; distributed ROCK algorithm; distribution pattern discovery; large categorical dataset clustering; performance analysis; robust hierarchical clustering algorithm; sequential ROCK algorithm; Algorithm design and analysis; Clustering algorithms; Data mining; Engines; Program processors; Robustness; Rocks; Categorical Dataset; Clustering; Distributed Computing; ROCK;
fLanguage
English
Publisher
ieee
Conference_Titel
Electronics Computer Technology (ICECT), 2011 3rd International Conference on
Conference_Location
Kanyakumari
Print_ISBN
978-1-4244-8678-6
Electronic_ISBN
978-1-4244-8679-3
Type
conf
DOI
10.1109/ICECTECH.2011.5941659
Filename
5941659
Link To Document