Title :
Cluster and Clustering Algorithm Validity in Image Retrieval
Author :
Cheng-Yuan Tang ; Yi-Leh Wu ; Yung-Chieh Lee
Author_Institution :
Huafan Univ., Taipei
Abstract :
To improve query efficiency, most image retrieval systems utilize clustering algorithms to build indices on image databases. In this paper, we analysis region-based image retrieval (RBIR) system query results, hypothesis test and Hubert´s Gamma statistic for cluster validation. Our experiment results suggest that images originated from different categories do form clusters in the feature space and thus can be separated. We then analysis the performance of clustering algorithms including K-means, fuzzy C-mean, CA-clustering, density based spatial clustering of applications with noise (DBSCAN) and the proposed modified DBSCAN algorithm with a second merging phase. Our experiment results suggest that the proposed algorithm has clustering performance among the best algorithms. And the proposed modified DBSCAN algorithm with a second merging phase can avoid some of the drawbacks (e.g., k random initial cluster centers) in the original K-means algorithms.
Keywords :
content-based retrieval; database indexing; feature extraction; fuzzy set theory; image retrieval; pattern clustering; visual databases; CA-clustering; Hubert Gamma statistic; K-means clustering; cluster validation; clustering algorithm; density based spatial clustering; feature space; fuzzy C-mean clustering; image databases; index building; query efficiency; region-based image retrieval; Algorithm design and analysis; Clustering algorithms; Image analysis; Image databases; Image retrieval; Information retrieval; Merging; Performance analysis; Statistical analysis; System testing; Cluster Validity; Clustering Algorithm; Image Retrieval;
Conference_Titel :
Systems, Man and Cybernetics, 2006. SMC '06. IEEE International Conference on
Conference_Location :
Taipei
Print_ISBN :
1-4244-0099-6
DOI :
10.1109/ICSMC.2006.384630