DocumentCode :
2989966
Title :
Minimum description length and clustering with exemplars
Author :
Lai, Po-Hsiang ; Sullivan, Joseph A O ; Pless, Robert
Author_Institution :
Dept. of Electr. & Syst. Eng., Washington Univ. in St Louis, St. Louis, MO, USA
fYear :
2009
fDate :
June 28 2009-July 3 2009
Firstpage :
1318
Lastpage :
1322
Abstract :
We propose an information-theoretic clustering framework for density-based clustering and similarity or distance-based clustering with objective functions of clustering performance derived from stochastic complexity and minimum description length (MDL) arguments. Under this framework, the number of clusters and parameters can be determined in a principled way without prior knowledge from users. We show that similarity-based clustering can be viewed as combinatorial optimization on graphs. We propose two clustering algorithms, one of which relies on a minimum arborescence tree algorithm which returns optimal clustering under the proposed MDL objective function for similarity-based clustering. We demonstrate clustering performance on synthetic data.
Keywords :
optimisation; pattern clustering; stochastic processes; trees (mathematics); clustering performance objective functions; combinatorial optimization; density-based clustering; distance-based clustering; minimum arborescence tree algorithm; minimum description length; similarity-based clustering; stochastic complexity; synthetic data clustering performance; Artificial intelligence; Bioinformatics; Clustering algorithms; Clustering methods; Computer science; Learning; Partitioning algorithms; Stochastic systems; Systems engineering and theory; Tree graphs;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Theory, 2009. ISIT 2009. IEEE International Symposium on
Conference_Location :
Seoul
Print_ISBN :
978-1-4244-4312-3
Electronic_ISBN :
978-1-4244-4313-0
Type :
conf
DOI :
10.1109/ISIT.2009.5205937
Filename :
5205937
Link To Document :
بازگشت