• DocumentCode
    3321989
  • Title

    Enhanced biclustering on expression data

  • Author

    Yang, Jiong ; Wang, Haixun ; Wang, Wei ; Yu, Philip

  • fYear
    2003
  • fDate
    10-12 March 2003
  • Firstpage
    321
  • Lastpage
    327
  • Abstract
    Microarrays are one of the latest breakthroughs in experimental molecular biology, which provide a powerful tool by which the expression patterns of thousands of genes can be monitored simultaneously and are already producing huge amount of valuable data. The concept of bicluster was introduced by Cheng and Church (2000) to capture the coherence of a subset of genes and a subset of conditions. A set of heuristic algorithms were also designed to either find one bicluster or a set of biclusters, which consist of iterations of masking values and discovered biclusters, coarse and fine node deletion, node addition, and the inclusion of inverted data. These heuristics inevitably suffer from some serious drawback. The masking of values and discovered biclusters with random numbers may result in the phenomenon of random interference which in turn impacts the discovery of high quality biclusters. To address this issue and to further accelerate the biclustering process, we generalize the model of bicluster to incorporate values and propose a probabilistic algorithm (FLOC) that can discover a set of k possibly overlapping biclusters simultaneously. Furthermore, this algorithm can easily be extended to support additional features that suit different requirements at virtually little cost. Experimental study on the yeast gene expression data shows that the FLOC algorithm can offer substantial improvements over the previously proposed algorithm.
  • Keywords
    arrays; biological techniques; genetics; iterative methods; microorganisms; molecular biophysics; physiological models; probability; enhanced biclustering; experimental molecular biology; expression data; genes subset; heuristic algorithms set; iterations; microarrays; values; Acceleration; Algorithm design and analysis; Coherence; Costs; Data analysis; Fluctuations; Fungi; Gene expression; Heuristic algorithms; Interference;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Bioengineering, 2003. Proceedings. Third IEEE Symposium on
  • Print_ISBN
    0-7695-1907-5
  • Type

    conf

  • DOI
    10.1109/BIBE.2003.1188969
  • Filename
    1188969