DocumentCode
2334684
Title
An agglomerative hierarchical clustering using partial maximum array and incremental similarity computation method
Author
Jung, Sung Young ; Kim, Taek-Soo
Author_Institution
Machine Intelligence Group, LG Electron. Inst. of Technol., Seoul, South Korea
fYear
2001
fDate
2001
Firstpage
265
Lastpage
272
Abstract
As the tractable amount of data grows in the computer science area, fast clustering algorithms are required, because traditional clustering algorithms are not feasible for very large and high-dimensional data. Many studies have been reported on the clustering of large databases, but most of them circumvent this problem by using an approximation method, resulting in the deterioration of accuracy. In this paper, we propose a new clustering algorithm by means of a partial maximum array, which can realize agglomerative hierarchical clustering with the same accuracy as the brute-force algorithm and has O(N2 ) time complexity. We also present an incremental method of similarity computation which substitutes a scalar calculation for the time-consuming calculation of vector similarity. Experimental results show that clustering becomes significantly fast for large and high-dimensional data
Keywords
arrays; computational complexity; data mining; data structures; database theory; pattern clustering; very large databases; accuracy deterioration; agglomerative hierarchical clustering; approximation method; fast clustering algorithm; high-dimensional data; incremental similarity computation method; large data sets; large databases; partial maximum array; scalar calculation; time complexity; Approximation methods; Clustering algorithms; Clustering methods; Computational efficiency; Computer science; Databases; Iterative algorithms; Machine intelligence; Merging; Nearest neighbor searches;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining, 2001. ICDM 2001, Proceedings IEEE International Conference on
Conference_Location
San Jose, CA
Print_ISBN
0-7695-1119-8
Type
conf
DOI
10.1109/ICDM.2001.989528
Filename
989528
Link To Document