Title :
DPBC: Distance Based Possibilistic Biclustering with Application to Gene Expression Analysis
Author :
Mahfouz, Mohamed A. ; Ismail, Mohamed A.
Author_Institution :
Dept. of Comput. & Syst. Eng., Alexandria Univ., Alexandria, Egypt
Abstract :
Biclustering is a key step in analyzing gene expression data by identifying patterns where subset of genes are co-related based on a subset of conditions. This paper proposes a new distance based possibilistic biclustering algorithm (DPBC), in which the average distances between rows and between columns of the bicluster are minimized and at the same time the size of the bicluster is maximized by computing the zeros of the derivative of appropriate objective function. The proposed algorithm uses the possibilistic clustering paradigm similar to another existing possibilistic biclustering algorithm PBC. Whereas PBC is based on residue our approach is applicable to any accepted definition for distances between pairs of rows or columns. Experimental study on the human dataset and several artificial datasets having different noise levels shows that the DPBC algorithm can offer substantial improvements over the previously proposed algorithms.
Keywords :
data mining; fuzzy set theory; genetics; pattern clustering; possibility theory; statistical analysis; artificial datasets; distance based possibilistic biclustering algorithm; fuzzy set theory; gene expression; human dataset; noise levels; objective function; Application software; Clustering algorithms; Data analysis; Data engineering; Gene expression; Humans; Iterative algorithms; Noise level; Pattern analysis; Systems engineering and theory;
Conference_Titel :
Bioinformatics and Biomedical Engineering , 2009. ICBBE 2009. 3rd International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-2901-1
Electronic_ISBN :
978-1-4244-2902-8
DOI :
10.1109/ICBBE.2009.5162662