Title :
Iterative weighted k-NN for constructing missing feature values in Wisconsin breast cancer dataset
Author :
Ashraf, Mohammad ; Le, Kim ; Huang, Xu
Author_Institution :
ISE, Univ. of Canberra, Bruce, ACT, Australia
Abstract :
This paper presents a new approach for constructing missing feature values based on iterative nearest neighbors and distance metrics. The proposed approach employs weighted k nearest neighbors´ algorithm and propagating the classification accuracy to a certain threshold. The proposed method showed improvement of classification accuracy of 0.005 in the constructed dataset than the original dataset which contain some missing feature values. The maximum classification accuracy was 0.9698 on k=1. This work is a component from a research for an automated diagnosing for breast cancer. The main aim of the current paper is to prepare the dataset for mining process. Future work includes applying the proposed method on more datasets.
Keywords :
cancer; data mining; iterative methods; medical computing; patient diagnosis; pattern classification; Wisconsin breast cancer dataset; breast cancer diagnosis; classification accuracy; dataset mining process; distance metrics; iterative weighted k-NN; k-nearest neighbor; missing feature value construction; Accuracy; Breast cancer; Data mining; Euclidean distance; Training; Constructing Missing Features Values; Data Mining; Distance Metrics; Iterative k-NN; Wisconsin Breast Cancer Dataset;
Conference_Titel :
Data Mining and Intelligent Information Technology Applications (ICMiA), 2011 3rd International Conference on
Print_ISBN :
978-1-4673-0231-9