• DocumentCode
    2745728
  • Title

    Gene selection using biological knowledge and fuzzy clustering

  • Author

    Ghosh, Sampreeti ; Mitra, Sushmita

  • Author_Institution
    Machine Intell. Unit, Indian Stat. Inst., India
  • fYear
    2012
  • fDate
    10-15 June 2012
  • Firstpage
    1
  • Lastpage
    9
  • Abstract
    Gene expression data being high-dimensional and redundant, dimensionality reduction is of prime concern. We employ the algorithm Fuzzy Clustering Large Applications based on RAN-domized Search (FCLARANS) for attribute clustering and dimensionality reduction based on the study of gene ontology and differential gene expressions. The use of domain knowledge helps in the automated selection of biologically meaningful partitions. The use of Gene Ontology (GO) study helps in detecting biologically enriched and statistically significant clusters. Fold-change is measured to select the differentially expressed genes as the representatives of these clusters. Tools like Eisen plot and cluster profiles of these clusters help establish their coherence. Important representative features (or genes) are extracted from each enriched gene partitions to form the reduced gene space. While the reduced gene set forms a biologically meaningful gene space it simultaneously leads to a decrease in computational burden. External validation of the reduced subspace, using various well-known classifiers, establishes the effectiveness of the proposed methodology.
  • Keywords
    bioinformatics; feature extraction; fuzzy set theory; ontologies (artificial intelligence); pattern classification; pattern clustering; Eisen plot; FCLARANS; GO; attribute clustering; biological knowledge; biologically meaningful partitions; computational burden; differential gene expressions; dimensionality reduction; domain knowledge; fuzzy clustering; fuzzy clustering large applications based on RAN-domized search; gene expression data; gene ontology; gene selection; gene space reduction; representative features extraction; statistically significant clusters; Cancer; Clustering algorithms; Colon; Gene expression; Ontologies; Uncertainty;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems (FUZZ-IEEE), 2012 IEEE International Conference on
  • Conference_Location
    Brisbane, QLD
  • ISSN
    1098-7584
  • Print_ISBN
    978-1-4673-1507-4
  • Electronic_ISBN
    1098-7584
  • Type

    conf

  • DOI
    10.1109/FUZZ-IEEE.2012.6250797
  • Filename
    6250797