• DocumentCode
    2370345
  • Title

    Clustering microarray time-series data using expectation maximization and multiple profile alignment

  • Author

    Subhani, Numanul ; Rueda, Luis ; Ngom, Alioune ; Burden, Conrad J.

  • Author_Institution
    Sch. of Comput. Sci., Univ. of Windsor, Windsor, ON, Canada
  • fYear
    2009
  • fDate
    1-4 Nov. 2009
  • Firstpage
    2
  • Lastpage
    7
  • Abstract
    A common problem in biology is to partition a set of experimental data into clusters in such a way that the data points within the same cluster are highly similar while data points in different clusters are very different. In this direction, clustering microarray time-series data via pairwise alignment of piece-wise linear profiles has been recently introduced. We propose a EM clustering approach based on a multiple alignment of natural cubic spline representations of gene expression profiles. The multiple alignment is achieved by minimizing the sum of integrated squared errors over a time-interval, defined on a set of profiles. Preliminary experiments on a well-known data set of 221 pre-clustered Saccharomyces cerevisiae gene expression profiles yield encouraging results with 83.26% accuracy.
  • Keywords
    biology computing; expectation-maximisation algorithm; pattern clustering; splines (mathematics); time series; EM clustering approach; Saccharomyces cerevisiae gene expression profiles; cubic spline representations; data points; expectation maximization; gene expression profiles; microarray time-series data cluster; multiple profile alignment; piece-wise linear profiles; Australia; Bayesian methods; Bioinformatics; Biology; Computer science; Gene expression; Genomics; Piecewise linear techniques; Poles and towers; Spline; Clustering; Cubic Spline; Gene Expression Profiles; Microarrays; Profile Alignment; Time-Series Data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Biomedicine Workshop, 2009. BIBMW 2009. IEEE International Conference on
  • Conference_Location
    Washington, DC
  • Print_ISBN
    978-1-4244-5121-0
  • Type

    conf

  • DOI
    10.1109/BIBMW.2009.5332128
  • Filename
    5332128