Title :
Indiscernibility degree of objects for evaluating simplicity of knowledge in the clustering procedure
Author :
Hirano, Shoji ; Tsumoto, Shusaku
Author_Institution :
Dept. of Med. Informatics, Shimane Med. Univ., Izumo, Japan
Abstract :
The paper presents a novel, rough set-based clustering method that enables the evaluation of classification knowledge simplicity during the clustering procedure. The method iteratively refines equivalence relations so that they become a more simple set of relations that give adequate coarse classification to the objects. At each step of the iteration, the importance of the equivalence relation is evaluated on the basis of the newly introduced measure, indiscernibility degree. An indiscernibility degree is defined as a ratio of equivalence relations that classify the two objects into the same equivalence class. If an equivalence relation has the ability to discern two objects that have a high indiscernibility degree, a very fine classification is performed and then modified to regard them as indiscernible objects. The refinement is repeated, decreasing the threshold level of indiscernibility degree, and finally simple clusters can be obtained. Experimental results on the artificial data shows that iterative refinement of equivalence relation leads to successful generation of coarse clusters that can be represented by simple knowledge
Keywords :
data mining; equivalence classes; pattern clustering; rough set theory; very large databases; artificial data; classification knowledge simplicity evaluation; clustering procedure; coarse classification; coarse clusters; equivalence class; equivalence relation; equivalence relations; indiscernibility degree; iterative refinement; rough set-based clustering method; simple clusters; threshold level; Algorithm design and analysis; Biomedical informatics; Clustering algorithms; Clustering methods; Data analysis; Databases; Rough sets; Scalability; Sections; Set theory;
Conference_Titel :
Data Mining, 2001. ICDM 2001, Proceedings IEEE International Conference on
Conference_Location :
San Jose, CA
Print_ISBN :
0-7695-1119-8
DOI :
10.1109/ICDM.2001.989521