• DocumentCode
    2767170
  • Title

    Function prediction for in silico protein mutagenesis using transduction and active learning

  • Author

    Basit, Nada ; Wechsler, Harry

  • Author_Institution
    Dept. of Comput. Sci., George Mason Univ., Fairfax, VA, USA
  • fYear
    2011
  • fDate
    12-15 Nov. 2011
  • Firstpage
    939
  • Lastpage
    940
  • Abstract
    As wet lab mutagenesis is expensive and time consuming one would use instead in silico mutagenesis. Towards that end, this paper proposes a novel method, T-RF-AL, for in silico mutagenesis. The method combines the merits of transduction, random forests, and active learning, the latter driven by a criterion of maximum curiosity. The feasibility of the T-RF-AL is shown on predicting mutant activity for HIV-1 Protease (HIV-1) and Bacteriophage T4 Lysozyme (T4) datasets. The new method, incremental in nature, compares favorably against random forests using the same active learning criteria, that of maximum curiosity. The observed advantages include better prediction accuracy that is achieved faster and using less training data.
  • Keywords
    biochemistry; biology computing; cellular biophysics; enzymes; genetics; learning (artificial intelligence); molecular biophysics; HIV-1 protease; active learning; bacteriophage T4 lysozyme datasets; function prediction; in silico protein mutagenesis; mutant activity; random forests; transduction; wet lab mutagenesis; Accuracy; Amino acids; Bioinformatics; Learning systems; Protein engineering; Proteins; active learning; enzyme mutant activity; incremental learning; maximum curiosity; protein function prediction; random forests; transduction;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Biomedicine Workshops (BIBMW), 2011 IEEE International Conference on
  • Conference_Location
    Atlanta, GA
  • Print_ISBN
    978-1-4577-1612-6
  • Type

    conf

  • DOI
    10.1109/BIBMW.2011.6112511
  • Filename
    6112511