• DocumentCode
    2069920
  • Title

    Gene classification using expression profiles: a feasibility study

  • Author

    Kuramochi, Michihiro ; Karypis, George

  • Author_Institution
    Dept. of Comput. Sci., Minnesota Univ., Minneapolis, MN, USA
  • fYear
    2001
  • fDate
    4-6 Nov 2001
  • Firstpage
    191
  • Lastpage
    200
  • Abstract
    As various genome sequencing projects have already been completed or are near completion, genome researchers are shifting their focus to functional genomics. Functional genomics represents the next phase, that expands the biological investigation to studying the functionality of genes of a single organism as well as studying and correlating the functionality of genes across many different organisms. Recently developed methods for monitoring genome-wide mRNA expression changes hold the promise of allowing us to inexpensively gain insights into the function of unknown genes. In this paper we focus on evaluating the feasibility of using supervised machine learning methods for determining the function of genes based solely on their expression profiles. We experimentally evaluate the performance of traditional classification algorithms such as support vector machines and k-nearest neighbors on the yeast genome, and present new approaches for classification that improve the overall recall with moderate reductions in precision. Our experiments show that the accuracies achieved for different classes varies dramatically. In analyzing these results we show that the achieved accuracy is highly dependent on whether or not the genes of that class were significantly active during the various experimental conditions, suggesting that gene expression profiles can become a viable alternative to sequence similarity searches provided that the genes are observed under a wide range of experimental conditions
  • Keywords
    biology computing; genetics; learning (artificial intelligence); learning automata; expression profiles; functional genomics; gene classification; genome sequencing projects; k-nearest neighbors; organism; sequence similarity searches; supervised machine learning methods; support vector machines; yeast genome; Bioinformatics; Classification algorithms; Fungi; Genomics; Learning systems; Machine learning algorithms; Monitoring; Organisms; Support vector machine classification; Support vector machines;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Bioengineering Conference, 2001. Proceedings of the IEEE 2nd International Symposium on
  • Conference_Location
    Bethesda, MD
  • Print_ISBN
    0-7695-1423-5
  • Type

    conf

  • DOI
    10.1109/BIBE.2001.974429
  • Filename
    974429