DocumentCode :
3129728
Title :
Identifying HotSpots in Lung Cancer Data Using Association Rule Mining
Author :
Agrawal, Ankit ; Choudhary, Alok
Author_Institution :
Dept. of Electr. Eng. & Comput. Sci., Northwestern Univ., Evanston, IL, USA
fYear :
2011
fDate :
11-11 Dec. 2011
Firstpage :
995
Lastpage :
1002
Abstract :
We analyze the lung cancer data available from the SEER program with the aim of identifying hotspots using association rule mining techniques. A subset of 13 patient attributes from the SEER data were recently linked with the survival outcome using prediction models, which is used in this study for segmentation. The goal here is to identify characteristics of patient segments where average survival is significantly higher/lower than average survival across the entire dataset. Automated association rule mining techniques resulted in hundreds of rules, from which many redundant rules were manually removed based on domain knowledge. The resulting rules conform with existing biomedical knowledge and provide interesting insights into lung cancer survival.
Keywords :
cancer; data mining; lung; medical information systems; HotSpots identification; SEER program; association rule mining; automated association rule mining techniques; biomedical knowledge; lung cancer data; patient segments; Association rules; Cancer; Lungs; Lymph nodes; Surgery; Tumors; Association rule mining; hotspots; lung cancer;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining Workshops (ICDMW), 2011 IEEE 11th International Conference on
Conference_Location :
Vancouver, BC
Print_ISBN :
978-1-4673-0005-6
Type :
conf
DOI :
10.1109/ICDMW.2011.93
Filename :
6137489
Link To Document :
بازگشت