• DocumentCode
    3590410
  • Title

    Iterative weighted k-NN for constructing missing feature values in Wisconsin breast cancer dataset

  • Author

    Ashraf, Mohammad ; Le, Kim ; Huang, Xu

  • Author_Institution
    ISE, Univ. of Canberra, Bruce, ACT, Australia
  • fYear
    2011
  • Firstpage
    23
  • Lastpage
    27
  • Abstract
    This paper presents a new approach for constructing missing feature values based on iterative nearest neighbors and distance metrics. The proposed approach employs weighted k nearest neighbors´ algorithm and propagating the classification accuracy to a certain threshold. The proposed method showed improvement of classification accuracy of 0.005 in the constructed dataset than the original dataset which contain some missing feature values. The maximum classification accuracy was 0.9698 on k=1. This work is a component from a research for an automated diagnosing for breast cancer. The main aim of the current paper is to prepare the dataset for mining process. Future work includes applying the proposed method on more datasets.
  • Keywords
    cancer; data mining; iterative methods; medical computing; patient diagnosis; pattern classification; Wisconsin breast cancer dataset; breast cancer diagnosis; classification accuracy; dataset mining process; distance metrics; iterative weighted k-NN; k-nearest neighbor; missing feature value construction; Accuracy; Breast cancer; Data mining; Euclidean distance; Training; Constructing Missing Features Values; Data Mining; Distance Metrics; Iterative k-NN; Wisconsin Breast Cancer Dataset;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining and Intelligent Information Technology Applications (ICMiA), 2011 3rd International Conference on
  • Print_ISBN
    978-1-4673-0231-9
  • Type

    conf

  • Filename
    6108393