DocumentCode
3369848
Title
Coercion: A Distributed Clustering Algorithm for Categorical Data
Author
Bin Wang ; Yang Zhou ; Xinhong Hei
Author_Institution
Sch. of Comput. Sci. & Eng., Xi´an Univ. of Technol., Xi´an, China
fYear
2013
fDate
14-15 Dec. 2013
Firstpage
683
Lastpage
687
Abstract
Clustering is an important technology in data mining. Squeezer is one such clustering algorithm for categorical data and it is more efficient than most existing algorithms for categorical data. But Squeezer is time consuming for very large datasets which are distributed in different servers. Thus, we employ the distributed thinking to improve Squeezer and a distributed algorithm for categorical data called Coercion is proposed in this paper. In order to present detailed complexity results for Coercion, we also conduct an experimental study with standard as well as synthetic data sets to demonstrate the effectiveness of the new algorithm.
Keywords
data mining; pattern clustering; Coercion; Squeezer; categorical data; data mining; distributed clustering algorithm; Algorithm design and analysis; Clustering algorithms; Distributed databases; Partitioning algorithms; Presses; Rocks; Servers; Sqeezer; categorical data; clustering; data mining; distributed;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational Intelligence and Security (CIS), 2013 9th International Conference on
Conference_Location
Leshan
Print_ISBN
978-1-4799-2548-3
Type
conf
DOI
10.1109/CIS.2013.149
Filename
6746517
Link To Document