• DocumentCode
    2035024
  • Title

    Implementation of distributed ROCK algorithm for clustering of large categorical datasets and its performance analysis

  • Author

    Patidar, Anil ; Joshi, Ritesh ; Mishra, Surendra

  • Author_Institution
    MCA, MITM, Indore, India
  • Volume
    2
  • fYear
    2011
  • fDate
    8-10 April 2011
  • Firstpage
    79
  • Lastpage
    83
  • Abstract
    Clustering in data mining, is useful to discover distribution patterns in the underlying data. ROCK is one such hierarchical clustering algorithm, which works on sampled data. We show that sequential ROCK algorithm is time consuming for large dataset. Instead, we present distributed algorithms with better performance than known algorithms. We develop a robust hierarchical clustering algorithm ROCK that employs preliminary calculations to be done at different processors. In addition to presenting detailed complexity results for DROCK we also conduct an experimental study with real life data sets to demonstrate the effectiveness of our technique.
  • Keywords
    data mining; distributed algorithms; pattern clustering; data mining; distributed ROCK algorithm; distribution pattern discovery; large categorical dataset clustering; performance analysis; robust hierarchical clustering algorithm; sequential ROCK algorithm; Algorithm design and analysis; Clustering algorithms; Data mining; Engines; Program processors; Robustness; Rocks; Categorical Dataset; Clustering; Distributed Computing; ROCK;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Electronics Computer Technology (ICECT), 2011 3rd International Conference on
  • Conference_Location
    Kanyakumari
  • Print_ISBN
    978-1-4244-8678-6
  • Electronic_ISBN
    978-1-4244-8679-3
  • Type

    conf

  • DOI
    10.1109/ICECTECH.2011.5941659
  • Filename
    5941659