Title :
Postprocessing of rule sets induced from a melanoma data set
Author :
Grzymala-Busse, Jerzy W. ; Hippe, Zdzislaw S.
Author_Institution :
Dept. of Electr. Eng. & Comput. Sci., Kansas Univ., Lawrence, KS, USA
Abstract :
The data mining system LERS (learning from examples based on rough sets) was used to induce rule sets from a data set describing melanoma (a dangerous skin cancer). The main objective of our research was to decrease the error rates for diagnosis of two fatal forms of melanoma based on these rule sets. The improvement was accomplished using two different techniques for postprocessing of rule sets: changing of rule strengths and rule truncation cutoffs. A rule strength is defined as the number of training cases correctly, classified by the rule. Rule truncation means an elimination of weaker rules. The criterion for the choice of the optimal form of the rule sets was the minimum of the sum of error rates for diagnosis of the two fatal forms of melanoma. Our research shows that at the cost of a minimal increase of the total error rate for patients that do not need immediate help, the sum of error rates for dangerous forms of melanoma may, be highly decreased. Also, for the optimal rule set, the sum of error rates for all forms of melanoma is minimal as well.
Keywords :
data mining; knowledge based systems; learning by example; medical computing; optimisation; pattern classification; rough set theory; data mining system; learning from examples; medical computing; melanoma data set; optimisation; pattern classification; rough set theory; rule set processing; rule truncation; Artificial intelligence; Computer science; Data engineering; Diseases; Error analysis; Expert systems; Malignant tumors; Medical diagnostic imaging; Skin cancer; Systems engineering and theory;
Conference_Titel :
Computer Software and Applications Conference, 2002. COMPSAC 2002. Proceedings. 26th Annual International
Print_ISBN :
0-7695-1727-7
DOI :
10.1109/CMPSAC.2002.1045166