DocumentCode
1122689
Title
On the Comparison of Conceptual Clustering and Numerical Taxonomy
Author
Dale, Michael B.
Author_Institution
CSIRO Division of Computing Research, St. Lucia, Australia 4067.
Issue
2
fYear
1985
fDate
3/1/1985 12:00:00 AM
Firstpage
241
Lastpage
244
Abstract
Conceptual clustering requires that clusters formed in the process shall be definable in terms of simple formulas in the predicate calculus. Michalski and Stepp [1] have argued that the results obtained with this method are clearly superior to traditional methods of numerical classification, so that an order of magnitude degradation in performance is acceptable. In this paper the results and comparisons presented by Michalski and Stepp are reviewed and shown to be less than adequate to support such a conclusion. There are considerable problems with data coding and standardization, as well as choice of similarity measure, that make the results difficult to evaluate. Even accepting the clusters, two different valorizing schemes are used to evaluate the results obtained. In addition, the traditional agglomerative algorithm employed in numerical classification procedures can be adapted to perform conceptual clustering without an enormous degradation in performance. However, the value of the method can only be regarded as unproven.
Keywords
Calculus; Clustering algorithms; Control systems; Data analysis; Degradation; Humans; Performance analysis; Standardization; Taxonomy; Unsupervised learning; Classification theory; cluster theory; comparison of cluster; conceptual clustering; data analysis; numerical taxonomy; unsupervised learning; validation;
fLanguage
English
Journal_Title
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher
ieee
ISSN
0162-8828
Type
jour
DOI
10.1109/TPAMI.1985.4767647
Filename
4767647
Link To Document