• DocumentCode
    269229
  • Title

    LateBiclustering: Efficient Heuristic Algorithm for Time-Lagged Bicluster Identification

  • Author

    Gonçalves, Joana P. ; Madeira, Sara C.

  • Author_Institution
    Centrum Wiskunde & Inf., Amsterdam, Netherlands
  • Volume
    11
  • Issue
    5
  • fYear
    2014
  • fDate
    Sept.-Oct. 2014
  • Firstpage
    801
  • Lastpage
    813
  • Abstract
    Identifying patterns in temporal data is key to uncover meaningful relationships in diverse domains, from stock trading to social interactions. Also of great interest are clinical and biological applications, namely monitoring patient response to treatment or characterizing activity at the molecular level. In biology, researchers seek to gain insight into gene functions and dynamics of biological processes, as well as potential perturbations of these leading to disease, through the study of patterns emerging from gene expression time series. Clustering can group genes exhibiting similar expression profiles, but focuses on global patterns denoting rather broad, unspecific responses. Biclustering reveals local patterns, which more naturally capture the intricate collaboration between biological players, particularly under a temporal setting. Despite the general biclustering formulation being NP-hard, considering specific properties of time series has led to efficient solutions for the discovery of temporally aligned patterns. Notably, the identification of biclusters with time-lagged patterns, suggestive of transcriptional cascades, remains a challenge due to the combinatorial explosion of delayed occurrences. Herein, we propose LateBiclustering, a sensible heuristic algorithm enabling a polynomial rather than exponential time solution for the problem. We show that it identifies meaningful time-lagged biclusters relevant to the response of Saccharomyces cerevisiae to heat stress.
  • Keywords
    biology computing; computational complexity; diseases; genetics; microorganisms; pattern clustering; string matching; time series; LateBiclustering; NP-hard; Saccharomyces cerevisiae; biological applications; biological players; biological processes; clinical applications; clustering; disease; diverse domain relationships; gene functions; heat stress; heuristic algorithm; molecular level; patient response monitoring; patient treatment; polynomial time solution; potential perturbations; social interactions; temporal data patterns; temporal setting; temporally aligned patterns; time series; time-lagged bicluster identification; transcriptional cascades; Bioinformatics; Computational biology; Gene expression; Pattern matching; Time series analysis; Time series; biclustering; local pattern; pattern matching; pattern recognition; string matching; time lag;
  • fLanguage
    English
  • Journal_Title
    Computational Biology and Bioinformatics, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    1545-5963
  • Type

    jour

  • DOI
    10.1109/TCBB.2014.2312007
  • Filename
    6774461