• DocumentCode
    873044
  • Title

    Combinatorial Analysis for Sequence and Spatial Motif Discovery in Short Sequence Fragments

  • Author

    Jackups, Ronald, Jr. ; Liang, Jie

  • Author_Institution
    Dept. of Pathology & Immunology, Washington Univ. Sch. of Med., St. Louis, MO, USA
  • Volume
    7
  • Issue
    3
  • fYear
    2010
  • Firstpage
    524
  • Lastpage
    536
  • Abstract
    Motifs are overrepresented sequence or spatial patterns appearing in proteins. They often play important roles in maintaining protein stability and in facilitating protein function. When motifs are located in short sequence fragments, as in transmembrane domains that are only 6-20 residues in length, and when there is only very limited data, it is difficult to identify motifs. In this study, we introduce combinatorial models based on permutation for assessing statistically significant sequence and spatial patterns in short sequences. We show that our method can uncover previously unknown sequence and spatial motifs in β-beta-barrel membrane proteins and that our method outperforms existing methods in detecting statistically significant motifs in this data set. Last, we discuss implications of motif analysis for problems involving short sequences in other families of proteins.
  • Keywords
    biology computing; biomembranes; molecular biophysics; proteins; β-barrel membrane proteins; combinatorial analysis; protein stability; short sequence fragments; spatial motif discovery; Assembly; Bioinformatics; Biomembranes; Books; Frequency; Helium; Proteins; Sequences; Stability; Statistical distributions; Combinatorial algorithms; Motifs; Permutations and combinations; combinatorial models; sequence analysis.; short sequence; Algorithms; Amino Acid Motifs; Amino Acid Sequence; Models, Molecular; Protein Folding; Protein Structure, Secondary; Proteins; Sequence Alignment; Sequence Analysis, Protein;
  • fLanguage
    English
  • Journal_Title
    Computational Biology and Bioinformatics, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    1545-5963
  • Type

    jour

  • DOI
    10.1109/TCBB.2008.101
  • Filename
    4633345