• DocumentCode
    3123933
  • Title

    Semi-supervised Feature Importance Evaluation with Ensemble Learning

  • Author

    Barkia, Hasna ; Elghazel, Haytham ; Aussem, Alex

  • Author_Institution
    Univ. de Lyon, Lyon, France
  • fYear
    2011
  • fDate
    11-14 Dec. 2011
  • Firstpage
    31
  • Lastpage
    40
  • Abstract
    We consider the problem of using a large amount of unlabeled data to improve the efficiency of feature selection in high dimensional datasets, when only a small set of labeled examples is available. We propose a new semi-supervised feature importance evaluation method (SSFI for short), that combines ideas from co-training and random forests with a new permutation-based out-of-bag feature importance measure. We provide empirical results on several benchmark datasets indicating that SSFI can lead to significant improvement over state-of-the-art semi-supervised and supervised algorithms.
  • Keywords
    learning (artificial intelligence); pattern classification; cotraining; ensemble learning; feature selection; high dimensional datasets; permutation-based out-of-bag feature importance measure; random forests; semisupervised feature importance evaluation; unlabeled data; Bagging; Labeling; Manifolds; Prediction algorithms; Radio frequency; Training; Vectors; Bagging; Co-training; Ensemble Method; Feature Selection; Random Subspaces Method; Semi-Supervised Learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining (ICDM), 2011 IEEE 11th International Conference on
  • Conference_Location
    Vancouver,BC
  • ISSN
    1550-4786
  • Print_ISBN
    978-1-4577-2075-8
  • Type

    conf

  • DOI
    10.1109/ICDM.2011.129
  • Filename
    6137207