• DocumentCode
    1991297
  • Title

    BALBOA: Extending Bicluster Analysis to Classify ORFs using Expression Data

  • Author

    Bryan, Kenneth ; Cunningham, Pádraig

  • Author_Institution
    Univ. Coll. Dublin, Dublin
  • fYear
    2007
  • fDate
    14-17 Oct. 2007
  • Firstpage
    995
  • Lastpage
    1002
  • Abstract
    Microarrays have the capacity to measure the expressions of thousands of genes in parallel over many experimental samples. The unsupervised technique of bicluster analysis has been employed previously to uncover gene expression correlations over subsets of samples with the aim of modelling the natural gene functional classes. However the bicluster model also has the potential to shed light on the functions of unannotated open reading frames (ORFs). This aspect of biclustering has been under-explored. In this work we illustrate how the bicluster representation of expression data may be extended to enable putative functional classification of unannotated ORFs. We develop an ORF annotation approach, referred to as BALBOA, in which classifiers are constructed from the class specific expression patterns discovered by bicluster analysis. We demonstrate the efficacy of this approach via cross validation and carry out a comparative evaluation with kNN classification across three yeast expression datasets. Finally, we assign putative functions to unannotated ORFs and attempt to corroborate the best supported annotations with external experimental and protein sequence information.
  • Keywords
    biology computing; data analysis; genetics; microorganisms; molecular biophysics; pattern classification; pattern clustering; proteins; BALBOA; bicluster analysis; functional classification; gene expression; microarrays; natural gene functional classes; protein sequence information; unannotated open reading frames; yeast expression datasets; Adaptive systems; Computer science; Data analysis; Educational institutions; Fungi; Gene expression; Informatics; Pattern analysis; Performance analysis; Protein sequence;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Bioengineering, 2007. BIBE 2007. Proceedings of the 7th IEEE International Conference on
  • Conference_Location
    Boston, MA
  • Print_ISBN
    978-1-4244-1509-0
  • Type

    conf

  • DOI
    10.1109/BIBE.2007.4375679
  • Filename
    4375679