• DocumentCode
    3369848
  • Title

    Coercion: A Distributed Clustering Algorithm for Categorical Data

  • Author

    Bin Wang ; Yang Zhou ; Xinhong Hei

  • Author_Institution
    Sch. of Comput. Sci. & Eng., Xi´an Univ. of Technol., Xi´an, China
  • fYear
    2013
  • fDate
    14-15 Dec. 2013
  • Firstpage
    683
  • Lastpage
    687
  • Abstract
    Clustering is an important technology in data mining. Squeezer is one such clustering algorithm for categorical data and it is more efficient than most existing algorithms for categorical data. But Squeezer is time consuming for very large datasets which are distributed in different servers. Thus, we employ the distributed thinking to improve Squeezer and a distributed algorithm for categorical data called Coercion is proposed in this paper. In order to present detailed complexity results for Coercion, we also conduct an experimental study with standard as well as synthetic data sets to demonstrate the effectiveness of the new algorithm.
  • Keywords
    data mining; pattern clustering; Coercion; Squeezer; categorical data; data mining; distributed clustering algorithm; Algorithm design and analysis; Clustering algorithms; Distributed databases; Partitioning algorithms; Presses; Rocks; Servers; Sqeezer; categorical data; clustering; data mining; distributed;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence and Security (CIS), 2013 9th International Conference on
  • Conference_Location
    Leshan
  • Print_ISBN
    978-1-4799-2548-3
  • Type

    conf

  • DOI
    10.1109/CIS.2013.149
  • Filename
    6746517