Title :
Hierarchical Agglomerative Clustering with Ordering Constraints
Author :
Zhao, Haifeng ; Qi, ZiJie
Author_Institution :
Dept. of Comput. Sci., Univ. of California, Davis, Davis, CA, USA
Abstract :
Many previous researchers have converted background knowledge as constraints to obtain accurate clustering. These clustering methods are usually called constrained clustering. Previous ordering constraints are instance level non-hierarchical constraints, like must-link and cannot-link constraints, which do not provide hierarchical information. In order to incorporate the hierarchical background knowledge into agglomerative clustering, we extend instance-level constraint to hierarchical constraint in this paper. We name it as ordering constraint. Ordering constraints can be used to capture hierarchical side information and they allow the user to encode hierarchical knowledge such as ontologies into agglomerative algorithms. We experimented with ordering constraints on labeled newsgroup data. Experiments showed that the dendrogram generated by ordering constraints is more similar to the pre-known hierarchy than the dendrogram generated by previous agglomerative clustering algorithms. We believe this work will have a significant impact on the agglomerative clustering field.
Keywords :
constraint handling; knowledge representation; pattern clustering; constrained clustering; dendrogram; hierarchical agglomerative clustering algorithm; hierarchical background knowledge; hierarchical information; instance-level constraint; ontologies; ordering constraints; Clustering algorithms; Clustering methods; Computer science; Data mining; Delay; Merging; Ontologies; constrained clustering; hierarchical agglomerative clustering; ordering constraint;
Conference_Titel :
Knowledge Discovery and Data Mining, 2010. WKDD '10. Third International Conference on
Conference_Location :
Phuket
Print_ISBN :
978-1-4244-5397-9
Electronic_ISBN :
978-1-4244-5398-6
DOI :
10.1109/WKDD.2010.123