• DocumentCode
    3103541
  • Title

    Application of Function Domain and Pseudo Amino Acid Composition to Predict Hetero-Oligomer Protein Structural Types

  • Author

    Xiao, Xuan ; Wang, Pu

  • Author_Institution
    Sch. of Mech. & Electron. Eng., Jing-De-Zhen Ceramic Inst., Jing-De-Zhen, China
  • fYear
    2010
  • fDate
    18-20 June 2010
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    With the avalanche of protein sequences generated in the post-genomic age, it is highly desirable to develop an automated method by which crystallographic scientists can rapidly and effectively identify which quaternary attribute a particular protein chain has according to its sequence information. Given most of the previous studies are limited to homo-oligomers, in this paper, we will try to identify the quaternary attribute of hetero-oligomer proteins. For a hetero-oligomer, its type will be identified among the following six categories: (1) heterodimer, (2) heterotrimer, (3) heterotetramer, (4) heteropentamer, (5) heterohexamer, (6) heterooctamer. Using machine learning approach, the Fuzzy Nearest Neighbor Algorithm (FKNN), we developed a prediction system for protein quaternary structural type in which we incorporated functional domain composition (FunD) and pseudo-amino acid composition (PseAA). The overall accuracy achieved by this system is more than 80% in the Jack-knife test. Such a technique should improve the success rate of structural biology projects.
  • Keywords
    biological techniques; biology computing; fuzzy logic; genomics; learning (artificial intelligence); molecular biophysics; molecular configurations; proteins; Jack-knife test; functional domain composition; fuzzy nearest neighbor algorithm; hetero-oligomer protein structural types; heterodimer; heterohexamer; heterooctamer; heteropentamer; heterotetramer; heterotrimer; machine learning approach; post-genomic age; protein chain; protein quaternary structural type; protein sequences; pseudo amino acid composition; structural biology projects; Amino acids; Ceramics; Crystallography; Fuzzy systems; In vivo; Machine learning; Machine learning algorithms; Nearest neighbor searches; Protein engineering; Sequences;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Biomedical Engineering (iCBBE), 2010 4th International Conference on
  • Conference_Location
    Chengdu
  • ISSN
    2151-7614
  • Print_ISBN
    978-1-4244-4712-1
  • Electronic_ISBN
    2151-7614
  • Type

    conf

  • DOI
    10.1109/ICBBE.2010.5515624
  • Filename
    5515624