DocumentCode :
1380043
Title :
Assessment of the Risk Factors of Coronary Heart Events Based on Data Mining With Decision Trees
Author :
KARAOLIS, MINAS A. ; Moutiris, Joseph A. ; Hadjipanayi, Demetra ; Pattichis, Constantinos S.
Author_Institution :
Dept. of Comput. Sci., Univ. of Cyprus, Nicosia, Cyprus
Volume :
14
Issue :
3
fYear :
2010
fDate :
5/1/2010 12:00:00 AM
Firstpage :
559
Lastpage :
566
Abstract :
Coronary heart disease (CHD) is one of the major causes of disability in adults as well as one of the main causes of death in the developed countries. Although significant progress has been made in the diagnosis and treatment of CHD, further investigation is still needed. The objective of this study was to develop a data-mining system for the assessment of heart event-related risk factors targeting in the reduction of CHD events. The risk factors investigated were: 1) before the event: a) nonmodifiable-age, sex, and family history for premature CHD, b) modifiable-smoking before the event, history of hypertension, and history of diabetes; and 2) after the event: modifiable-smoking after the event, systolic blood pressure, diastolic blood pressure, total cholesterol, high-density lipoprotein, low-density lipoprotein, triglycerides, and glucose. The events investigated were: myocardial infarction (MI), percutaneous coronary intervention (PCI), and coronary artery bypass graft surgery (CABG). A total of 528 cases were collected from the Paphos district in Cyprus, most of them with more than one event. Data-mining analysis was carried out using the C4.5 decision tree algorithm for the aforementioned three events using five different splitting criteria. The most important risk factors, as extracted from the classification rules analysis were: 1) for MI, age, smoking, and history of hypertension; 2) for PCI, family history, history of hypertension, and history of diabetes; and 3) for CABG, age, history of hypertension, and smoking. Most of these risk factors were also extracted by other investigators. The highest percentages of correct classifications achieved were 66%, 75%, and 75% for the MI, PCI, and CABG models, respectively. It is anticipated that data mining could help in the identification of high and low risk subgroups of subjects, a decisive factor for the selection of therapy, i.e., medical or surgical. However, further investigation with larger datasets is still- - needed.
Keywords :
bioinformatics; cardiology; data mining; decision trees; diseases; patient diagnosis; patient treatment; risk analysis; CHD diagnosis; CHD treatment; Cyprus; Paphos district; adult disability; coronary artery bypass graft surgery; coronary heart disease; data mining; death; decision tree; diabetes; diastolic blood pressure; glucose; high density lipoprotein; hypertension; low density lipoprotein; myocardial infarction; percutaneous coronary intervention; risk factor; smoking; systolic blood pressure; total cholesterol; triglyceride; Coronary heart disease (CHD); data mining; decision trees; risk factors; Adult; Aged; Aged, 80 and over; Algorithms; Coronary Disease; Data Mining; Decision Trees; Female; Humans; Male; Middle Aged; Models, Cardiovascular; Risk Factors;
fLanguage :
English
Journal_Title :
Information Technology in Biomedicine, IEEE Transactions on
Publisher :
ieee
ISSN :
1089-7771
Type :
jour
DOI :
10.1109/TITB.2009.2038906
Filename :
5378501
Link To Document :
بازگشت