• DocumentCode
    2966131
  • Title

    Prominent label identification and multi-label classification for cancer prognosis prediction

  • Author

    Saleema, J.S. ; Sairam, B. ; Naveen, S.D. ; Yuvaraj, K. ; Patnaik, L.M.

  • Author_Institution
    Dept. of Comp.Sc., Christ Univ., Bangalore, India
  • fYear
    2012
  • fDate
    19-22 Nov. 2012
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Cancer prognosis prediction improves the quality of treatment and increases the survivability of the patients. Conventional methods of cancer prediction deal with single class by limiting the prognosis prediction to one response variable. The SEER Public Use cancer database has more prominent variables that support better prediction approach. The objective of this paper is to find the prominent labels from cancer databases and use them in a multi-class environment. The implementation consist of three phases namely, pre-processing, prominent label identification and multi-label classification. Breast, Colorectal and Respiratory Cancer Data sets have been used for the experimentation. Also random samples from all three data sets are generated to form a mixed cancer data. Patient survival, number of primaries and age at diagnosis are the prominent labels identified from others using the Decision tree, Naïve Bayes and KNN algorithms. The three prominent labels have been tested using multi-label RAkEL algorithm to find the relations between them. The results of the empirical study are comparatively better than the traditional way of cancer prediction.
  • Keywords
    Bayes methods; biological organs; cancer; decision trees; medical diagnostic computing; patient diagnosis; patient treatment; pattern classification; KNN algorithms; Naive Bayes algorithm; SEER public use cancer database; breast cancer data sets; cancer prognosis prediction; colorectal cancer data sets; decision tree algorithms; multilabel RAkEL algorithm; multilabel classification; patient diagnosis; patient survivability; patient treatment; prominent label identification; respiratory cancer data sets; Accuracy; Breast; Cancer; Classification algorithms; Data mining; Decision trees; Prediction algorithms; Classifier; Multi-label; Response Variable; SEER;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    TENCON 2012 - 2012 IEEE Region 10 Conference
  • Conference_Location
    Cebu
  • ISSN
    2159-3442
  • Print_ISBN
    978-1-4673-4823-2
  • Electronic_ISBN
    2159-3442
  • Type

    conf

  • DOI
    10.1109/TENCON.2012.6412321
  • Filename
    6412321