• DocumentCode
    2516576
  • Title

    Prediction of Protein Quaternary Structural Type with Functional Domain and Pseudo Amino Acid Composition

  • Author

    Xiao, Xuan ; Wang, Pu

  • Author_Institution
    Sch. of Mech. & Electron. Eng., Jing-De-Zhen Ceramic Inst., Jing-De-Zhen, China
  • fYear
    2009
  • fDate
    11-13 June 2009
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    In the protein universe, many proteins are composed of two or more polypeptide chains, generally referred to as subunits, which associate through noncovalent interactions and, occasionally, disulfide bonds. With the number of protein sequences entering into data banks rapidly increasing, we are confronted with a challenge: how to develop an automated method to identify the quaternary attribute for a new polypeptide chain (i.e., whether it is formed just as a monomer, or as a dimer, trimer, or any other oligomer). This is important, because the functions of proteins are closely related to their quaternary attribute. In this report, using machine learning approach, the nearest neighbor algorithm (NNA) and covariant-discriminant algorithm (CDA), we developed a prediction system for protein quaternary structural type in which we incorporated functional domain composition (FunD) and pseudo-amino acid composition (PseAA). To compare, we adopted a benchmark dataset, which had been studied time after time. The overall accuracy achieved by this system is more than 89% in the Jack-knife test. Such a technique should improve the success rate of structural biology projects.
  • Keywords
    bioinformatics; covariance analysis; learning (artificial intelligence); molecular biophysics; molecular configurations; pattern classification; proteins; statistical testing; Jack-knife test; automated method; covariant-discriminant algorithm; data banks; disulfide bonds; functional domain; functional domain composition; machine learning approach; nearest neighbor classifier algorithm; noncovalent interactions; polypeptide chain; polypeptide chains; protein quaternary structural type prediction; protein sequences; pseudo amino acid composition; Amino acids; Benchmark testing; Ceramics; In vivo; Machine learning; Machine learning algorithms; Nearest neighbor searches; Protein engineering; Sequences; System testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Biomedical Engineering , 2009. ICBBE 2009. 3rd International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4244-2901-1
  • Electronic_ISBN
    978-1-4244-2902-8
  • Type

    conf

  • DOI
    10.1109/ICBBE.2009.5163214
  • Filename
    5163214