DocumentCode :
2527415
Title :
Fractal clustering for microarray data analysis
Author :
Wang, Lu-yong ; Balasubramanian, Ammaiappan ; Chakraborty, Amit ; Comaniciu, Dorin
Author_Institution :
Integrated Data Syst. Dept., Siemens Corp. Res. Inc., Princeton, NJ, USA
fYear :
2005
fDate :
8-11 Aug. 2005
Firstpage :
97
Lastpage :
98
Abstract :
DNA microarray experiments generate a substantial amount of information about global gene expression. Gene expression profiles can be represented as points in multi-dimensional space. It is essential to identify relevant groups of genes in biomedical research. Clustering is helpful in pattern recognition in gene expression profiles. Some clustering techniques have been introduced. However, these traditional methods mainly utilize shape-based assumption or distance metric to cluster the points in multi-dimension linear Euclidean space. Poor consistence with the functional annotation of genes is shown in their validation study. A fractal clustering method to cluster genes using intrinsic (fractal) dimension from modern geometry is proposed. Fractal dimension is used to characterize the degree of self similarity among the points in the clusters. The main idea of fractal clustering is to group points in a cluster in such a way that none of the points in the cluster changes the cluster´s intrinsic dimension radically. Hausdorff fractal dimension is computed through the means of the box-counting plot algorithm, since it is the fastest and also robust enough. This method is assessed using validation assessment using public microarray dataset. It shows that this method is superior in identifying functional related gene groups than other traditional methods.
Keywords :
DNA; arrays; cellular biophysics; fractals; genetics; molecular biophysics; pattern clustering; DNA microarray; Hausdorff fractal dimension; biomedical research; box-counting plot algorithm; fractal clustering; functional gene annotation; gene expression; linear Euclidean space; microarray data analysis; multidimensional space; pattern recognition; shape-based assumption; Clustering algorithms; Clustering methods; DNA; Data analysis; Extraterrestrial measurements; Fractals; Gene expression; Geometry; Pattern recognition; Robustness;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Systems Bioinformatics Conference, 2005. Workshops and Poster Abstracts. IEEE
Print_ISBN :
0-7695-2442-7
Type :
conf
DOI :
10.1109/CSBW.2005.66
Filename :
1540556
Link To Document :
بازگشت