DocumentCode
3590410
Title
Iterative weighted k-NN for constructing missing feature values in Wisconsin breast cancer dataset
Author
Ashraf, Mohammad ; Le, Kim ; Huang, Xu
Author_Institution
ISE, Univ. of Canberra, Bruce, ACT, Australia
fYear
2011
Firstpage
23
Lastpage
27
Abstract
This paper presents a new approach for constructing missing feature values based on iterative nearest neighbors and distance metrics. The proposed approach employs weighted k nearest neighbors´ algorithm and propagating the classification accuracy to a certain threshold. The proposed method showed improvement of classification accuracy of 0.005 in the constructed dataset than the original dataset which contain some missing feature values. The maximum classification accuracy was 0.9698 on k=1. This work is a component from a research for an automated diagnosing for breast cancer. The main aim of the current paper is to prepare the dataset for mining process. Future work includes applying the proposed method on more datasets.
Keywords
cancer; data mining; iterative methods; medical computing; patient diagnosis; pattern classification; Wisconsin breast cancer dataset; breast cancer diagnosis; classification accuracy; dataset mining process; distance metrics; iterative weighted k-NN; k-nearest neighbor; missing feature value construction; Accuracy; Breast cancer; Data mining; Euclidean distance; Training; Constructing Missing Features Values; Data Mining; Distance Metrics; Iterative k-NN; Wisconsin Breast Cancer Dataset;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining and Intelligent Information Technology Applications (ICMiA), 2011 3rd International Conference on
Print_ISBN
978-1-4673-0231-9
Type
conf
Filename
6108393
Link To Document