DocumentCode :
3590410
Title :
Iterative weighted k-NN for constructing missing feature values in Wisconsin breast cancer dataset
Author :
Ashraf, Mohammad ; Le, Kim ; Huang, Xu
Author_Institution :
ISE, Univ. of Canberra, Bruce, ACT, Australia
fYear :
2011
Firstpage :
23
Lastpage :
27
Abstract :
This paper presents a new approach for constructing missing feature values based on iterative nearest neighbors and distance metrics. The proposed approach employs weighted k nearest neighbors´ algorithm and propagating the classification accuracy to a certain threshold. The proposed method showed improvement of classification accuracy of 0.005 in the constructed dataset than the original dataset which contain some missing feature values. The maximum classification accuracy was 0.9698 on k=1. This work is a component from a research for an automated diagnosing for breast cancer. The main aim of the current paper is to prepare the dataset for mining process. Future work includes applying the proposed method on more datasets.
Keywords :
cancer; data mining; iterative methods; medical computing; patient diagnosis; pattern classification; Wisconsin breast cancer dataset; breast cancer diagnosis; classification accuracy; dataset mining process; distance metrics; iterative weighted k-NN; k-nearest neighbor; missing feature value construction; Accuracy; Breast cancer; Data mining; Euclidean distance; Training; Constructing Missing Features Values; Data Mining; Distance Metrics; Iterative k-NN; Wisconsin Breast Cancer Dataset;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining and Intelligent Information Technology Applications (ICMiA), 2011 3rd International Conference on
Print_ISBN :
978-1-4673-0231-9
Type :
conf
Filename :
6108393
Link To Document :
بازگشت