DocumentCode :
2866277
Title :
CLUMP: a scalable and robust framework for structure discovery
Author :
Punera, Kunal ; Ghosh, Joydeep
Author_Institution :
Electr. & Comput. Eng., Univ. of Texas at Austin, TX, USA
fYear :
2005
fDate :
27-30 Nov. 2005
Abstract :
We introduce a robust and efficient framework called CLUMP (CLustering Using Multiple Prototypes) for unsupervised discovery of structure in data. CLUMP relies on finding multiple prototypes that summarize the data. Clustering the prototypes enables our algorithm to scale up to extremely large and high-dimensional domains such as text data. Other desirable properties include robustness to noise and parameter choices. In this paper, we describe the approach in detail, characterize its performance on a variety of datasets, and compare it to some existing model selection approaches.
Keywords :
data mining; pattern clustering; text analysis; CLUMP; CLustering Using Multiple Prototypes; model selection; scalable robust framework; structure discovery; unsupervised discovery; Clustering algorithms; Data mining; Knee; Merging; Noise robustness; Noise shaping; Prototypes; Scalability; Statistics; Tree data structures;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining, Fifth IEEE International Conference on
ISSN :
1550-4786
Print_ISBN :
0-7695-2278-5
Type :
conf
DOI :
10.1109/ICDM.2005.43
Filename :
1565775
Link To Document :
بازگشت