• DocumentCode
    624497
  • Title

    A stable gene selection method based on sample weighting

  • Author

    Guoyin Wang ; Gao, J. ; Feng Hu

  • Author_Institution
    Chongqing Key Lab. of Comput. Intell., Chongqing Univ. of Posts & Telecommun., Chongqing, China
  • fYear
    2013
  • fDate
    5-8 May 2013
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    In microarray analysis, the selection of informative gene is an essential issue for tissue classification and successful treatment because of its ability to improve the accuracy and decrease computational complexity. However, the gene subsets selected by the same method often vary significantly with some variations of the samples in the same data set. Thus, the stability of the selected genes is an aspect as important as the predictive ability to quantify a feature selection algorithm. This work is an attempt to improve the stability of feature selection methods from the point of view of sample importance. We propose an efficient mean deviation-based sample weighting algorithm to improve the stability of feature selection methods by assigning a weight to each sample according to the mean deviation of its local value in giving samples. We perform a series of experiments with four frequently studied public data sets, and the experimental results validate that the proposed method improves the stability of common feature selection algorithms such as ReliefF without sacrificing the classification performance. Furthermore, more stable gene feature sets obtained by the proposed method than the state-of-the-art ensemble method and margin-based sample weighting algorithm.
  • Keywords
    biological tissues; feature extraction; genetics; importance sampling; lab-on-a-chip; pattern classification; stability; feature selection algorithm; feature selection method stability; gene subset; informative gene selection; mean deviation-based sample weighting algorithm; microarray analysis; sample importance; sample weighting; stable gene selection method; tissue classification; Feature extraction; Gene expression; Indexes; Prediction algorithms; Stability criteria; Training; Classification performance; Feature selection; Gene expression microarray; Stability;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Electrical and Computer Engineering (CCECE), 2013 26th Annual IEEE Canadian Conference on
  • Conference_Location
    Regina, SK
  • ISSN
    0840-7789
  • Print_ISBN
    978-1-4799-0031-2
  • Electronic_ISBN
    0840-7789
  • Type

    conf

  • DOI
    10.1109/CCECE.2013.6567792
  • Filename
    6567792