Title :
Diagnosis of Coronary Artery Disease Using Cost-Sensitive Algorithms
Author :
Alizadehsani, R. ; Hosseini, M.J. ; Sani, Zahra Alizadeh ; Ghandeharioun, A. ; Boghrati, R.
Author_Institution :
Dept. of Comput. Eng., Sharif Univ. of Technol., Tehran, Iran
Abstract :
One of the main causes of death the world over are cardiovascular diseases, of which coronary artery disease (CAD) is a major type. This disease occurs when the diameter narrowing of one of the left anterior descending, left circumflex, or right coronary arteries is equal to or greater than 50 percent. Angiography is the principal diagnostic modality for the stenos is of heart vessels, however, because of its complications and costs, researchers are looking for alternative methods such as data mining. This study conducts data mining algorithms on the Z-Alizadeh Sani dataset which has been collected from 303 random visitors to Tehran´s Shaheed Rajaei Cardiovascular, Medical and Research Center. In this paper, the reason of effectiveness of a preprocessing algorithm on the dataset is investigated. This algorithm which has been merely introduced in our previous works, extracts three new features from the dataset. These features are then used to enrich the primary dataset in order to achieve more accurate results. Moreover, despite the fact that misclassification of diseased patients has more side effects than that of healthy ones, to the best of our knowledge cost-sensitive algorithms have yet to be used in this field. Therefore, in this paper 10-fold cross validation on cost-sensitive algorithms along with base classifiers of Naïve Bayes, Sequential Minimal Optimization (SMO), K-Nearest Neighbors (KNN), Support Vector Machine (SVM), and C4.5 were employed. As a result, the SMO algorithm has yield to very high sensitivity (97.22%) and accuracy (92.09%) rates, the likes of which have not been reported simultaneously in the existing literature.
Keywords :
Bayes methods; angiocardiography; blood vessels; cardiology; data mining; diseases; feature extraction; medical computing; minimisation; pattern classification; support vector machines; 10-fold cross validation; C4.5 classifiers; K-nearest neighbors; KNN classifier; Naive Bayes classifier; SMO algorithm; SVM; Tehran; Z-Alizadeh Sani dataset; angiography; cardiovascular diseases; coronary artery disease diagnosis; data mining algorithms; diagnostic modality; diseased patient misclassification; feature extraction; heart vessels; knowledge cost-sensitive algorithms; left anterior descending; left circumflex; right coronary arteries; sequential minimal optimization; stenosis; support vector machine; Accuracy; Arteries; Data mining; Design automation; Diseases; Sensitivity; Support vector machines; C4.5 algorithm; Coronary Artery Disease; Cost Sensitive Algorithms; Data Mining; Feature Extraction; Naïve Bayes algorithm;
Conference_Titel :
Data Mining Workshops (ICDMW), 2012 IEEE 12th International Conference on
Conference_Location :
Brussels
Print_ISBN :
978-1-4673-5164-5
DOI :
10.1109/ICDMW.2012.29