• DocumentCode
    2010908
  • Title

    Gene Ontology Driven Feature Selection from Microarray Gene Expression Data

  • Author

    Qi, Jianlong ; Tang, Jian

  • Author_Institution
    Dept. of Comput. Sci., Memorial Univ. of Newfoundland, St. John, Nfld.
  • fYear
    2006
  • fDate
    28-29 Sept. 2006
  • Firstpage
    1
  • Lastpage
    7
  • Abstract
    One of the main challenges in the classification of microarray gene expression data is the small sample size compared with the large number of genes, so feature selection is an essential step to remove genes not relevant to class label. Traditional gene selection methods often select the top-ranked genes based on their individual discriminative powers. The problem with these simple ranking models is that they evaluate genes in isolation and this may introduce redundancy among the selected feature subset. Most redundancy based methods solely evaluate gene expression levels. This may decrease the effectiveness of feature selection since some values may not be accurately measured. In this paper, we propose a gene ontology based method for feature selection. The novelty of this model is to detect redundancy between a pair of genes by the convex combination of their expression similarity and semantic similarity in gene ontology. The effectiveness of our method is demonstrated by the experiment in two widely used datasets
  • Keywords
    biology computing; feature extraction; genetics; medical image processing; ontologies (artificial intelligence); feature selection; gene ontology; microarray gene expression data; Computational efficiency; Computer science; Data analysis; Data mining; Gene expression; Labeling; Measurement standards; Nearest neighbor searches; Ontologies; Statistics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence and Bioinformatics and Computational Biology, 2006. CIBCB '06. 2006 IEEE Symposium on
  • Conference_Location
    Toronto, Ont.
  • Print_ISBN
    1-4244-0624-2
  • Electronic_ISBN
    1-4244-0624-2
  • Type

    conf

  • DOI
    10.1109/CIBCB.2006.330968
  • Filename
    4133204