Title :
A rough set based novel biclustering algorithm for gene expression data
Author :
Emilyn, J. Jeba ; Ramar, K.
Author_Institution :
Dept. of IT, Sona Coll. of Technol., Salem, India
Abstract :
Microarray technology has emerged as a boon to simultaneously monitor the expression levels of thousands of genes across collections of related samples. The main goal in the analysis of large and heterogeneous gene expression datasets is to identify groups of genes that get expressed in a set of experimental conditions. Several clustering techniques have been proposed for identifying gene signatures and to understand their role and many of them have been applied to gene expression data, but with partial success. This paper proposes to develop a novel biclustering technique (RBGED) that is based on rough set theory. This algorithm simultaneously clusters both the rows and columns of a data matrix. The advantage is that it overcomes the restriction of one object belonging to only one cluster. This algorithm is intelligent because it automatically determines the optimum number of clusters. A theoretical understanding of the proposed algorithm is analyzed and case studied with Rough Fuzzy k means algorithm.
Keywords :
biology computing; fuzzy set theory; genetics; pattern clustering; rough set theory; biclustering algorithm; data matrix; gene expression data; gene signature identification; microarray technology; rough fuzzy k means algorithm; rough set theory; Algorithm design and analysis; Approximation algorithms; Approximation methods; Clustering algorithms; Gene expression; Rough sets; Bichister Algorithm; Distance measure; Gene Expression data; K-means; Microarray; Rough sets;
Conference_Titel :
Electronics Computer Technology (ICECT), 2011 3rd International Conference on
Conference_Location :
Kanyakumari
Print_ISBN :
978-1-4244-8678-6
Electronic_ISBN :
978-1-4244-8679-3
DOI :
10.1109/ICECTECH.2011.5941702