• DocumentCode
    2711300
  • Title

    Biclustering Expression Data Using Node Addition Algorithm

  • Author

    Borah, B. ; Bhattacharyya, Dhruba Kumar

  • Author_Institution
    Tezpur Univ. Napam, Tezpur
  • fYear
    2007
  • fDate
    18-21 Dec. 2007
  • Firstpage
    307
  • Lastpage
    313
  • Abstract
    Biclustering algorithms simultaneously cluster both rows and columns. This type of algorithms are applied to gene expression data analysis to find a subset of genes that exhibit similar expression pattern under a subset of conditions. Cheng and Church introduced the mean squared residue measure to capture the coherence of a subset of genes over a subset of conditions. They provided a set of heuristic algorithms based primarily on node deletion to find one bicluster or a set of biclusters after masking discovered biclusters with random values. Masking of discovered biclusters with random values interferes with discovery of high quality biclusters. We provide an efficient node addition algorithm to find a set of biclusters without the need of masking discovered biclusters. Initialized with a gene and a subset of conditions, a bicluster is extended by adding more genes and conditions. Thus it provides facility to study individual genes, besides generating a large number of biclusters with different initializations. Biclusters with lower or higher scores within a specified limit can be generated by parameter setting. Use of incremental method of computing score makes the algorithm faster.
  • Keywords
    biology computing; data analysis; genetics; mean square error methods; pattern clustering; biclustering algorithms; biclustering expression data; gene expression data analysis; heuristic algorithms; mean squared residue measure; node addition algorithm; node deletion; Biology computing; Clustering algorithms; Coherence; Computer science; DNA; Data analysis; Data engineering; Fluctuations; Gene expression; Heuristic algorithms;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advanced Computing and Communications, 2007. ADCOM 2007. International Conference on
  • Conference_Location
    Guwahati, Assam
  • Print_ISBN
    0-7695-3059-1
  • Type

    conf

  • DOI
    10.1109/ADCOM.2007.122
  • Filename
    4425989