• DocumentCode
    1013400
  • Title

    Human-Readable Rule Generator for Integrating Amino Acid Sequence Information and Stability of Mutant Proteins

  • Author

    Huang, Liang-Tsung ; Lai, Lien-Fu ; Gromiha, M. Michael

  • Author_Institution
    Dept. of Inf. Commun., Mingdao Univ., Changhua, Taiwan
  • Volume
    7
  • Issue
    4
  • fYear
    2010
  • Firstpage
    681
  • Lastpage
    687
  • Abstract
    Most of the bioinformatics tools developed for predicting mutant protein stability appear as a black box and the relationship between amino acid sequence/structure and stability is hidden to the users. We have addressed this problem and developed a human-readable rule generator for integrating the knowledge of amino acid sequence and experimental stability change upon single mutation. Using information about the original residue, substituted residue, and three neighboring residues, classification rules have been generated to discriminate the stabilizing and destabilizing mutants and explore the basis for experimental data. These rules are human readable, and hence, the method enhances the synergy between expert knowledge and computational system. Furthermore, the performance of the rules has been assessed on a nonredundant data set of 1,859 mutants and we obtained an accuracy of 80 percent using cross validation. The results showed that the method could be effectively used as a tool for both knowledge discovery and predicting mutant protein stability. We have developed a Web for classification rule generator and it is freely available at http://bioinformatics.myweb.hinet.net/irobot.htm.
  • Keywords
    bioinformatics; knowledge acquisition; molecular biophysics; proteins; Web site; amino acid sequence information integration; bioinformatics tools; human-readable rule generator; knowledge discovery; mutant protein stability; nonredundant data set; Amino acids; Bioinformatics; Computational biology; Data mining; Decision trees; Genetic mutations; Humans; Protein engineering; Protein sequence; Stability analysis; Protein stability; classification rule; data mining.; prediction; protein stability; rule generator; Amino Acid Sequence; Amino Acids; Comprehension; Computational Biology; Databases, Protein; Mutant Proteins; Mutation; Protein Stability; Sequence Analysis, Protein;
  • fLanguage
    English
  • Journal_Title
    Computational Biology and Bioinformatics, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    1545-5963
  • Type

    jour

  • DOI
    10.1109/TCBB.2008.128
  • Filename
    4693706