DocumentCode
2398362
Title
Data Mining a Prostate Cancer Dataset Using Rough Sets
Author
Revett, Kenneth ; De Magalhães, Sérgio Tenreiro ; Santos, Henrique M D
Author_Institution
Harrow Sch. of Comput. Sci., Westminster Univ., London
fYear
2006
fDate
Sept. 2006
Firstpage
290
Lastpage
293
Abstract
Prostate cancer remains one of the leading causes of cancer death worldwide, with a reported incidence rate of 650,000 cases per annum worldwide. The causal factors of prostate cancer still remain to be determined. In this paper, we investigate a medical dataset containing clinical information on 502 prostate cancer patients using the machine learning technique of rough sets. Our preliminary results yield a classification accuracy of 90%, with high sensitivity and specificity (both at approximately 91%). Our results yield a predictive positive value (PPN) of 81% and a predictive negative value (PNV) of 95%. In addition to the high classification accuracy of our system, the rough set approach also provides a rule-based inference mechanism for information extraction that is suitable for integration into a rule-based system. The generated rules relate directly to the attributes and their values and provide a direct mapping between them
Keywords
cancer; data mining; learning (artificial intelligence); medical information systems; rough set theory; cancer classifier; clinical information; data mining; information extraction; machine learning; prostate cancer; rough set; rule-based inference mechanism; Biochemistry; Cancer detection; Data mining; Environmental factors; Lungs; Machine learning; Oncological surgery; Prostate cancer; Rough sets; Testing; Rough sets; cancer classifier; machine learning; prostate cancer dataset; reducts;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Systems, 2006 3rd International IEEE Conference on
Conference_Location
London
Print_ISBN
1-4244-01996-8
Electronic_ISBN
1-4244-01996-8
Type
conf
DOI
10.1109/IS.2006.348433
Filename
4155440
Link To Document