• Title of article

    Microarray gene expression classification with few genes: Criteria to combine attribute selection and classification methods

  • Author/Authors

    Alonso-Gonzلlez، نويسنده , , Carlos J. and Moro-Sancho، نويسنده , , Q. Isaac and Simon-Hurtado، نويسنده , , Arancha and Varela-Arrabal، نويسنده , , Ricardo، نويسنده ,

  • Issue Information
    روزنامه با شماره پیاپی سال 2012
  • Pages
    11
  • From page
    7270
  • To page
    7280
  • Abstract
    Microarray data classification is a task involving high dimensionality and small samples sizes. A common criterion to decide on the number of selected genes is maximizing the accuracy, which risks overfitting and usually selects more genes than actually needed. We propose, relaxing the maximum accuracy criterion, to select the combination of attribute selection and classification algorithm that using less attributes has an accuracy not statistically significantly worst that the best. Also we give some advice to choose a suitable combination of attribute selection and classifying algorithms for a good accuracy when using a low number of gene expressions. We used some well known attribute selection methods (FCBF, ReliefF and SVM-RFE, plus a Random selection, used as a base line technique) and classifying techniques (Naive Bayes, 3 Nearest Neighbor and SVM with linear kernel) applied to 30 data sets involving different cancer types.
  • Keywords
    Microarray data classification , feature selection , Efficient classification with few genes , Machine Learning
  • Journal title
    Expert Systems with Applications
  • Serial Year
    2012
  • Journal title
    Expert Systems with Applications
  • Record number

    2351925