• DocumentCode
    2341237
  • Title

    Protein-based analysis of alternative splicing in the human genome

  • Author

    Loraine, Ann E. ; Helt, Gregg A. ; Cline, Melissa S. ; Siani-Rose, Michael A.

  • Author_Institution
    Affymetrix Inc., Emeryville, CA, USA
  • fYear
    2002
  • fDate
    2002
  • Firstpage
    118
  • Lastpage
    124
  • Abstract
    Understanding the functional significance of alternative splicing and other mechanisms that generate RNA transcript diversity is an important challenge facing modern-day molecular biology. Using homology-based, protein sequence analysis methods, it should be possible to investigate how transcript diversity impacts protein structure and function. To test this, a data mining technique ("DiffHit") was developed to identify and catalog genes producing protein isoforms which exhibit distinct profiles of conserved protein motifs. We found that out of a test set of over 1,300 alternatively spliced genes with solved genomic structure, over 30% exhibited a differential profile of conserved InterPro and/or Blocks protein motifs across distinct isoforms. These results suggest that motif databases such as Blocks and InterPro are potentially useful tools for investigating how alternative transcript structure affects gene function.
  • Keywords
    biology computing; data mining; genetics; proteins; DiffHit; RNA transcript diversity; alternative splicing; conserved Blocks protein motifs; conserved InterPro protein motifs; conserved protein motifs; data mining; differential profile; gene cataloguing; gene identification; homology-based protein sequence analysis; human genome; molecular biology; motif databases; protein function; protein isoforms; protein structure; protein-based analysis; transcript structure; Bioinformatics; Data mining; Databases; Genomics; Humans; Protein engineering; Protein sequence; RNA; Splicing; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics Conference, 2002. Proceedings. IEEE Computer Society
  • Print_ISBN
    0-7695-1653-X
  • Type

    conf

  • DOI
    10.1109/CSB.2002.1039335
  • Filename
    1039335