DocumentCode :
2723246
Title :
Discovering Connected Patterns in Gene Expression Arrays
Author :
Yousri, Noha A. ; Ismail, Mohamed A. ; Kamel, Mohamed S.
Author_Institution :
IEEE Comput. & Syst. Eng., Alexandria Univ. of Alexandria
fYear :
2007
fDate :
1-5 April 2007
Firstpage :
113
Lastpage :
120
Abstract :
Clustering methods have been extensively used for gene expression data analysis to detect groups of related genes. The clusters provide useful information to analyze gene function, gene regulation and cellular patterns. Most existing clustering algorithms, though, discover only coherent gene expression patterns, and do not handle connected patterns. Coherent and connected patterns correspond to globular and arbitrary shaped clusters, respectively, in low dimensional spaces. For high dimensional gene expression data, two connected patterns can be two similar patterns with time lags in a time series data, or in general, two different patterns that are connected by an intermediate pattern that is related to both of them. Discovering such connected patterns has important biological implications not revealed by groups of coherent patterns. In this paper, a novel algorithm that finds connected patterns, in gene expression data, is proposed. Using a novel merge criterion, it can distinguish clusters based on distances between patterns, thus avoiding the effect of noise and outliers. Moreover, the algorithm uses a metric based on Pearson correlation to find neighbours, which renders it a lower complexity than related algorithms. Both time series and non temporal gene expression data sets are used to illustrate the efficiency of the proposed algorithm. Results on the serum and the leukaemia data sets reveal interesting biologically significant information
Keywords :
biology computing; data analysis; data mining; genetics; pattern clustering; cellular patterns; clustering method; connected pattern discovery; gene expression arrays; gene expression data analysis; gene function; gene regulation; Algorithm design and analysis; Bioinformatics; Biology computing; Clustering algorithms; Clustering methods; Computational biology; Computational intelligence; Couplings; Gene expression; Pattern analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Intelligence and Bioinformatics and Computational Biology, 2007. CIBCB '07. IEEE Symposium on
Conference_Location :
Honolulu, HI
Print_ISBN :
1-4244-0710-9
Type :
conf
DOI :
10.1109/CIBCB.2007.4221212
Filename :
4221212
Link To Document :
بازگشت