• DocumentCode
    3087238
  • Title

    Preference of Amino Acids in Different Protein Structural Classes: A Database Analysis

  • Author

    Ismail, Wazim Mohammed ; Chowdhury, Shibasish

  • Author_Institution
    Biol. Sci. Group, Birla Inst. of Technol. & Sci., Pilani, India
  • fYear
    2010
  • fDate
    18-20 June 2010
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    Understanding sequence-structure relationship is the key step in protein modeling and de novo protein design. Although almost 55,000 protein structures are solved and stored in protein data bank, elucidating sequence-structure relationship is still a challenging task. To understand sequence-structure relationship better, a statistical analysis of amino acid residues in four major structural classes of protein viz. α proteins, β proteins, α/β proteins and α+β proteins is performed. We use non-homologous proteins from (<; 30% identity) October 2008 release Brookhaven Protein Data Bank (PDB) with resolution better than 2.5 angstrom. Interestingly, in comparison to the helical protein, the helical propensities of hydrophobic residues in mix proteins (containing both α helix and β sheet) are increased significantly. On the other hand, the helical propensities of hydrophilic residues are reduced in mixed proteins. A reverse trend is observed in strand propensity. The difference in helical propensity of hydrophobic and hydrophilic residues in different fold may be due to differential folding mechanism. The size of protein may also play a crucial role. A position specific analysis of helices is also done in all α and α/β proteins. The detailed analysis of helix dissection revealed that, the presence of β sheet influences the individual preference of amino acids in different positions within helix. This result indicates that the preference of amino acid in different positions (N terminus, C terminus and middle) within α helix are influenced by long range interactions with other structural elements.
  • Keywords
    biology computing; molecular configurations; proteins; statistical analysis; C terminus; N terminus; amino acid preference; amino acid residues; database analysis; differential folding mechanism; helical protein; helix dissection; hydrophilic residues; hydrophobic residues; nonhomologous proteins; protein data bank; protein structural classes; protein structures; sequence-structure relationship; statistical analysis; structural elements; Algorithm design and analysis; Amino acids; Biological system modeling; Biology; Coils; Data analysis; Databases; Prediction algorithms; Proteins; Statistical analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Biomedical Engineering (iCBBE), 2010 4th International Conference on
  • Conference_Location
    Chengdu
  • ISSN
    2151-7614
  • Print_ISBN
    978-1-4244-4712-1
  • Electronic_ISBN
    2151-7614
  • Type

    conf

  • DOI
    10.1109/ICBBE.2010.5514826
  • Filename
    5514826