• DocumentCode
    1728101
  • Title

    Melanoma prediction using data mining system LERS

  • Author

    Grzymala-Busse, Jan P. ; Grzymala-Busse, Jerzy W. ; Hippe, Zdzislaw S.

  • Author_Institution
    Dept. of Electr. Eng. & Comput. Sci., Kansas Univ., Lawrence, KS, USA
  • fYear
    2001
  • fDate
    6/23/1905 12:00:00 AM
  • Firstpage
    615
  • Lastpage
    620
  • Abstract
    One of the important tools for early diagnosis of malignant melanoma is the total dermatoscopy score (TDS), computed using the ABCD (asymmetry, border, color, diameter) formula. Our primary objective was to check whether the ABCD formula is optimal. Using a data set containing 276 cases of melanoma and the LERS (Learning from Examples based on Rough Sets) data mining system, we checked more than 20,000 modified formulas for ABCD, computing the predicted error rate of melanoma diagnosis using 10-fold cross-validation for every modified formula. As a result, we found the optimal ABCD formula for our setup: discretization based on cluster analysis, the LEM2 (Learning from Examples Module, version 2) algorithm (one of the four LERS algorithms for rule induction) and the standard LERS classification scheme. The error rate for the standard ABCD formula was 10.21 %, while for the optimal ABCD formula the error rate was reduced to 6.04%. Some research in melanoma diagnosis shows that the use of the ABCD formula does not improve the error rate. Our research shows that the ABCD formula is useful, since, for our data set, the error rate without the use of the ABCD formula was higher (13.73%)
  • Keywords
    cancer; data mining; image classification; learning by example; medical diagnostic computing; medical expert systems; medical image processing; pattern clustering; skin; ABCD formula optimality; LEM2 algorithm; LERS classification scheme; LERS data mining system; asymmetry; border; cluster analysis; color; cross-validation; diameter; discretization; error rate prediction; learning from examples; malignant melanoma diagnosis; melanoma prediction; rough sets; rule induction; skin cancer; total dermatoscopy score; Algorithm design and analysis; Cancer; Classification algorithms; Clustering algorithms; Computer science; Data mining; Error analysis; Lesions; Malignant tumors; Systems engineering and theory;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Software and Applications Conference, 2001. COMPSAC 2001. 25th Annual International
  • Conference_Location
    Chicago, IL
  • ISSN
    0730-3157
  • Print_ISBN
    0-7695-1372-7
  • Type

    conf

  • DOI
    10.1109/CMPSAC.2001.960676
  • Filename
    960676