DocumentCode
1728101
Title
Melanoma prediction using data mining system LERS
Author
Grzymala-Busse, Jan P. ; Grzymala-Busse, Jerzy W. ; Hippe, Zdzislaw S.
Author_Institution
Dept. of Electr. Eng. & Comput. Sci., Kansas Univ., Lawrence, KS, USA
fYear
2001
fDate
6/23/1905 12:00:00 AM
Firstpage
615
Lastpage
620
Abstract
One of the important tools for early diagnosis of malignant melanoma is the total dermatoscopy score (TDS), computed using the ABCD (asymmetry, border, color, diameter) formula. Our primary objective was to check whether the ABCD formula is optimal. Using a data set containing 276 cases of melanoma and the LERS (Learning from Examples based on Rough Sets) data mining system, we checked more than 20,000 modified formulas for ABCD, computing the predicted error rate of melanoma diagnosis using 10-fold cross-validation for every modified formula. As a result, we found the optimal ABCD formula for our setup: discretization based on cluster analysis, the LEM2 (Learning from Examples Module, version 2) algorithm (one of the four LERS algorithms for rule induction) and the standard LERS classification scheme. The error rate for the standard ABCD formula was 10.21 %, while for the optimal ABCD formula the error rate was reduced to 6.04%. Some research in melanoma diagnosis shows that the use of the ABCD formula does not improve the error rate. Our research shows that the ABCD formula is useful, since, for our data set, the error rate without the use of the ABCD formula was higher (13.73%)
Keywords
cancer; data mining; image classification; learning by example; medical diagnostic computing; medical expert systems; medical image processing; pattern clustering; skin; ABCD formula optimality; LEM2 algorithm; LERS classification scheme; LERS data mining system; asymmetry; border; cluster analysis; color; cross-validation; diameter; discretization; error rate prediction; learning from examples; malignant melanoma diagnosis; melanoma prediction; rough sets; rule induction; skin cancer; total dermatoscopy score; Algorithm design and analysis; Cancer; Classification algorithms; Clustering algorithms; Computer science; Data mining; Error analysis; Lesions; Malignant tumors; Systems engineering and theory;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Software and Applications Conference, 2001. COMPSAC 2001. 25th Annual International
Conference_Location
Chicago, IL
ISSN
0730-3157
Print_ISBN
0-7695-1372-7
Type
conf
DOI
10.1109/CMPSAC.2001.960676
Filename
960676
Link To Document