DocumentCode :
514810
Title :
A Cluster Description Method for High Dimensional Data Clustering with Categorical Variables
Author :
Wu, Sen ; Gu, Shujuan
Author_Institution :
Sch. of Econ. & Manage., Univ. of Sci. & Technol. Beijing, Beijing, China
Volume :
1
fYear :
2010
fDate :
13-14 March 2010
Firstpage :
32
Lastpage :
35
Abstract :
High dimensional data clustering is always of great difficulty in clustering research. Before the clustering process is accomplished, the partition of the objects is unknown. Therefore after the clustering process, the results of the final clusters should be presented understandably, which will be strictly difficult when it comes to high dimensionality. This paper presents a cluster description schema for high dimensional data clustering with categorical variables. The description schema presented in this paper uses supremum and infimum to represent the clusters concisely and based on the schema a new method is given to assign the non-sample objects to clusters obtained from sample space. The distribution process requires one-time scan of dataset, updates the description of clusters dynamically, and can detect the isolated objects. Experiments on both synthetic and real data show its effectiveness and scalability.
Keywords :
data mining; pattern clustering; categorical variables; cluster description method; data mining; high dimensional data clustering; Automation; Clustering algorithms; Conference management; Data mining; Mechatronics; Nearest neighbor searches; Object detection; Sampling methods; Scalability; Technology management; Categorical Variables; Clustering; Data Mining; High Dimensional Space; KDD;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Measuring Technology and Mechatronics Automation (ICMTMA), 2010 International Conference on
Conference_Location :
Changsha City
Print_ISBN :
978-1-4244-5001-5
Electronic_ISBN :
978-1-4244-5739-7
Type :
conf
DOI :
10.1109/ICMTMA.2010.147
Filename :
5459107
Link To Document :
بازگشت