Title :
Bayesian Feedback in Data Clustering
Author :
Jain, A.K. ; Mallapragada, Pavan K. ; Law, Martin
Author_Institution :
Dept. of Comput. Sci. & Eng., Michigan State Univ., East Lansing, MI
Abstract :
In many clustering applications, the user has some vague notion of the number and membership of the desired clusters. However, it is difficult for the user to provide such knowledge explicitly in the clustering process. We propose a solution to circumvent this difficulty by introducing a feedback mechanism. The notion of Bayesian inference for relevance feedback in content-based image retrieval is modified for data clustering. Given the number of clusters, the proposed algorithm seeks information about the target partition by asking the user a sequence of queries about whether a pair of objects should be put in the same cluster or not. Information-theoretic criteria is adopted to select the queries to be presented to the user. The assumption made here is that cluster labels are "smooth", i.e., similar objects should share the same cluster labels. We show that it is possible to obtain reasonable partitions based on the user feedback alone, without the need of specifying a clustering objective function
Keywords :
belief networks; inference mechanisms; information theory; pattern clustering; relevance feedback; Bayesian feedback; Bayesian inference; data clustering; information-theoretic criteria; relevance feedback; Application software; Bayesian methods; Clustering algorithms; Computer science; Content based retrieval; Feedback; Image retrieval; Inference algorithms; Information retrieval; Partitioning algorithms;
Conference_Titel :
Pattern Recognition, 2006. ICPR 2006. 18th International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
0-7695-2521-0
DOI :
10.1109/ICPR.2006.316