Title :
Clustering microarray time-series data using expectation maximization and multiple profile alignment
Author :
Subhani, Numanul ; Rueda, Luis ; Ngom, Alioune ; Burden, Conrad J.
Author_Institution :
Sch. of Comput. Sci., Univ. of Windsor, Windsor, ON, Canada
Abstract :
A common problem in biology is to partition a set of experimental data into clusters in such a way that the data points within the same cluster are highly similar while data points in different clusters are very different. In this direction, clustering microarray time-series data via pairwise alignment of piece-wise linear profiles has been recently introduced. We propose a EM clustering approach based on a multiple alignment of natural cubic spline representations of gene expression profiles. The multiple alignment is achieved by minimizing the sum of integrated squared errors over a time-interval, defined on a set of profiles. Preliminary experiments on a well-known data set of 221 pre-clustered Saccharomyces cerevisiae gene expression profiles yield encouraging results with 83.26% accuracy.
Keywords :
biology computing; expectation-maximisation algorithm; pattern clustering; splines (mathematics); time series; EM clustering approach; Saccharomyces cerevisiae gene expression profiles; cubic spline representations; data points; expectation maximization; gene expression profiles; microarray time-series data cluster; multiple profile alignment; piece-wise linear profiles; Australia; Bayesian methods; Bioinformatics; Biology; Computer science; Gene expression; Genomics; Piecewise linear techniques; Poles and towers; Spline; Clustering; Cubic Spline; Gene Expression Profiles; Microarrays; Profile Alignment; Time-Series Data;
Conference_Titel :
Bioinformatics and Biomedicine Workshop, 2009. BIBMW 2009. IEEE International Conference on
Conference_Location :
Washington, DC
Print_ISBN :
978-1-4244-5121-0
DOI :
10.1109/BIBMW.2009.5332128