DocumentCode :
28652
Title :
Rough-Fuzzy Clustering for Grouping Functionally Similar Genes from Microarray Data
Author :
Maji, Pradipta ; Paul, Sudipta
Author_Institution :
Machine Intell. Unit, Indian Stat. Inst., Kolkata, India
Volume :
10
Issue :
2
fYear :
2013
fDate :
March-April 2013
Firstpage :
286
Lastpage :
299
Abstract :
Gene expression data clustering is one of the important tasks of functional genomics as it provides a powerful tool for studying functional relationships of genes in a biological process. Identifying coexpressed groups of genes represents the basic challenge in gene clustering problem. In this regard, a gene clustering algorithm, termed as robust rough-fuzzy c-means, is proposed judiciously integrating the merits of rough sets and fuzzy sets. While the concept of lower and upper approximations of rough sets deals with uncertainty, vagueness, and incompleteness in cluster definition, the integration of probabilistic and possibilistic memberships of fuzzy sets enables efficient handling of overlapping partitions in noisy environment. The concept of possibilistic lower bound and probabilistic boundary of a cluster, introduced in robust rough-fuzzy c-means, enables efficient selection of gene clusters. An efficient method is proposed to select initial prototypes of different gene clusters, which enables the proposed c-means algorithm to converge to an optimum or near optimum solutions and helps to discover coexpressed gene clusters. The effectiveness of the algorithm, along with a comparison with other algorithms, is demonstrated both qualitatively and quantitatively on 14 yeast microarray data sets.
Keywords :
bioinformatics; fuzzy set theory; genetic algorithms; genetics; genomics; lab-on-a-chip; microorganisms; pattern clustering; probability; biological process; cluster possibilistic lower bound; cluster probabilistic boundary; coexpressed gene cluster selection; functional genomics; gene clustering algorithm; gene expression data clustering; rough-fuzzy c-means algorithm; rough-fuzzy clustering; yeast microarray data set; Approximation methods; Clustering algorithms; Gene expression; Indexes; Probabilistic logic; Prototypes; Robustness; Approximation methods; Clustering algorithms; Gene expression; Indexes; Microarray; Probabilistic logic; Prototypes; Robustness; fuzzy sets; gene clustering; overlapping clustering; rough sets; Algorithms; Cluster Analysis; Computational Biology; Databases, Genetic; Fuzzy Logic; Gene Expression Profiling; Gene Regulatory Networks; Genes, Fungal; Models, Genetic; Oligonucleotide Array Sequence Analysis; Yeasts;
fLanguage :
English
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
Publisher :
ieee
ISSN :
1545-5963
Type :
jour
DOI :
10.1109/TCBB.2012.103
Filename :
6256659
Link To Document :
بازگشت