DocumentCode
2966131
Title
Prominent label identification and multi-label classification for cancer prognosis prediction
Author
Saleema, J.S. ; Sairam, B. ; Naveen, S.D. ; Yuvaraj, K. ; Patnaik, L.M.
Author_Institution
Dept. of Comp.Sc., Christ Univ., Bangalore, India
fYear
2012
fDate
19-22 Nov. 2012
Firstpage
1
Lastpage
6
Abstract
Cancer prognosis prediction improves the quality of treatment and increases the survivability of the patients. Conventional methods of cancer prediction deal with single class by limiting the prognosis prediction to one response variable. The SEER Public Use cancer database has more prominent variables that support better prediction approach. The objective of this paper is to find the prominent labels from cancer databases and use them in a multi-class environment. The implementation consist of three phases namely, pre-processing, prominent label identification and multi-label classification. Breast, Colorectal and Respiratory Cancer Data sets have been used for the experimentation. Also random samples from all three data sets are generated to form a mixed cancer data. Patient survival, number of primaries and age at diagnosis are the prominent labels identified from others using the Decision tree, Naïve Bayes and KNN algorithms. The three prominent labels have been tested using multi-label RAkEL algorithm to find the relations between them. The results of the empirical study are comparatively better than the traditional way of cancer prediction.
Keywords
Bayes methods; biological organs; cancer; decision trees; medical diagnostic computing; patient diagnosis; patient treatment; pattern classification; KNN algorithms; Naive Bayes algorithm; SEER public use cancer database; breast cancer data sets; cancer prognosis prediction; colorectal cancer data sets; decision tree algorithms; multilabel RAkEL algorithm; multilabel classification; patient diagnosis; patient survivability; patient treatment; prominent label identification; respiratory cancer data sets; Accuracy; Breast; Cancer; Classification algorithms; Data mining; Decision trees; Prediction algorithms; Classifier; Multi-label; Response Variable; SEER;
fLanguage
English
Publisher
ieee
Conference_Titel
TENCON 2012 - 2012 IEEE Region 10 Conference
Conference_Location
Cebu
ISSN
2159-3442
Print_ISBN
978-1-4673-4823-2
Electronic_ISBN
2159-3442
Type
conf
DOI
10.1109/TENCON.2012.6412321
Filename
6412321
Link To Document