Title :
Rough sets used in the measurement of similarity of mixed mode data
Author :
Coppock, Sarah ; Mazlack, Lawrence
Author_Institution :
ECECS Dept., Cincinnati Univ., OH, USA
Abstract :
Similarity is important in knowledge discovery. Cluster analysis, classification, and granulation each involve some notion or definition of similarity. The measurement of similarity is selected based on the domain and distribution of the data. Even within a specific domain, some similarity metrics may be considered more useful than others. There is an amount of uncertainty in quantitatively measuring the similarity between records of mixed data. The uncertainty develops from the lack of scale that both nominal and ordinal data have. Rough set theory is one tool developed for handling uncertainty. Rough sets can be used in dissimilarity analysis of qualitative data. It would seem that rough sets could be applied in measuring similarity between records containing both quantitative and qualitative data for the purpose of clustering the records.
Keywords :
data mining; pattern clustering; rough set theory; uncertainty handling; cluster analysis; knowledge discovery; mixed data; mixed mode data; rough set theory; similarity measurement; similarity metrics; uncertainty handling; Artificial intelligence; Cities and towns; Clustering algorithms; Data analysis; Frequency; Laboratories; Measurement uncertainty; Rough sets; Set theory; Unsupervised learning;
Conference_Titel :
Fuzzy Information Processing Society, 2003. NAFIPS 2003. 22nd International Conference of the North American
Print_ISBN :
0-7803-7918-7
DOI :
10.1109/NAFIPS.2003.1226781