• DocumentCode
    813924
  • Title

    Iterative RELIEF for Feature Weighting: Algorithms, Theories, and Applications

  • Author

    Sun, Yijun

  • Author_Institution
    Interdisciplinary Center for Biotechnology Res., Florida Univ., Gainesville, FL
  • Volume
    29
  • Issue
    6
  • fYear
    2007
  • fDate
    6/1/2007 12:00:00 AM
  • Firstpage
    1035
  • Lastpage
    1051
  • Abstract
    RELIEF is considered one of the most successful algorithms for assessing the quality of features. In this paper, we propose a set of new feature weighting algorithms that perform significantly better than RELIEF, without introducing a large increase in computational complexity. Our work starts from a mathematical interpretation of the seemingly heuristic RELIEF algorithm as an online method solving a convex optimization problem with a margin-based objective function. This interpretation explains the success of RELIEF in real application and enables us to identify and address its following weaknesses. RELIEF makes an implicit assumption that the nearest neighbors found in the original feature space are the ones in the weighted space and RELIEF lacks a mechanism to deal with outlier data. We propose an iterative RELIEF (I-RELIEF) algorithm to alleviate the deficiencies of RELIEF by exploring the framework of the expectation-maximization algorithm. We extend I-RELIEF to multiclass settings by using a new multiclass margin definition. To reduce computational costs, an online learning algorithm is also developed. Convergence analysis of the proposed algorithms is presented. The results of large-scale experiments on the UCI and microarray data sets are reported, which demonstrate the effectiveness of the proposed algorithms, and verify the presented theoretical results
  • Keywords
    computational complexity; expectation-maximisation algorithm; feature extraction; computational complexity; convex optimization; expectation-maximization algorithm; feature weighting; iterative RELIEF; Algorithm design and analysis; Computational complexity; Computational efficiency; Convergence; Expectation-maximization algorithms; Heuristic algorithms; Iterative algorithms; Large-scale systems; Nearest neighbor searches; Optimization methods; DNA microarray; Feature weighting; RELIEF; classification.; feature selection; iterative algorithm; Algorithms; Artificial Intelligence; Cluster Analysis; Image Interpretation, Computer-Assisted; Imaging, Three-Dimensional; Information Storage and Retrieval; Pattern Recognition, Automated; Reproducibility of Results; Sensitivity and Specificity;
  • fLanguage
    English
  • Journal_Title
    Pattern Analysis and Machine Intelligence, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0162-8828
  • Type

    jour

  • DOI
    10.1109/TPAMI.2007.1093
  • Filename
    4160953