DocumentCode :
2208386
Title :
Multi-dimensional Mass Estimation and Mass-based Clustering
Author :
Kai Ming Ting ; Wells, J.R.
Author_Institution :
Gippsland Sch. of Inf. Technol., Monash Univ., Clayton, VIC, Australia
fYear :
2010
fDate :
13-17 Dec. 2010
Firstpage :
511
Lastpage :
520
Abstract :
Mass estimation, an alternative to density estimation, has been shown recently to be an effective base modelling mechanism for three data mining tasks of regression, information retrieval and anomaly detection. This paper advances this work in two directions. First, we generalise the previously proposed one-dimensional mass estimation to multidimensional mass estimation, and significantly reduce the time complexity to O(ψh) from O(ψh)-making it feasible for a full range of generic problems. Second, we introduce the first clustering method based on mass-it is unique because it does not employ any distance or density measure. The structure of the new mass model enables different parts of a cluster to be identified and merged without expensive evaluations. The characteristics of the new clustering method are: (i) it can identify arbitrary-shape clusters; (ii) it is significantly faster than existing density-based or distance-based methods; and (iii) it is noise-tolerant.
Keywords :
computational complexity; data mining; information retrieval; pattern clustering; anomaly detection; arbitrary-shape cluster; base modelling mechanism; data mining task; density estimation; density-based method; distance-based method; information retrieval; mass-based clustering; multidimensional mass estimation; one dimensional mass estimation; time complexity; Clustering methods; Complexity theory; Data mining; Data models; Equations; Estimation; Runtime; Mass estimation; mass-based clustering;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining (ICDM), 2010 IEEE 10th International Conference on
Conference_Location :
Sydney, NSW
ISSN :
1550-4786
Print_ISBN :
978-1-4244-9131-5
Type :
conf
DOI :
10.1109/ICDM.2010.49
Filename :
5694005
Link To Document :
بازگشت