• DocumentCode
    478732
  • Title

    The Relationship between Protein Sequences and their Gene Ontology Functions

  • Author

    Duan, Zhong-Hui ; Hughes, Brent ; Reichel, Lothar ; Shi, Ting

  • Author_Institution
    Dept. of Comput. Sci., Akron Univ., OH
  • Volume
    1
  • fYear
    2006
  • fDate
    20-24 June 2006
  • Firstpage
    76
  • Lastpage
    83
  • Abstract
    The underlying assumption of many automated sequence annotation methods is that similar sequences imply similar biological functions. The present paper re-examines this assumption. A novel measure based on a set of local BLAST alignments is introduced to define the overall similarity between two protein sequences. The relationships between yeast protein sequences and their biological functions in the context of gene ontology categories are presented, and the effects of the level of gene ontology terms and the size of gene ontology groups on the degree of similarity are studied. The similarity distributions at different levels of gene ontology trees are considered. To evaluate the theoretical prediction power of similar sequences, we compute the posterior probability of correct predictions. The results indicate that the posterior probability can serve as an important measure for automated protein function prediction
  • Keywords
    biology computing; genetics; ontologies (artificial intelligence); proteins; sequences; statistical distributions; BLAST alignment; automated protein function prediction; automated sequence annotation method; biological function; gene ontology function; probability distribution; protein sequence; Bioinformatics; Biological information theory; Biological processes; Biology; Databases; Fungi; Genomics; Ontologies; Organisms; Proteins;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer and Computational Sciences, 2006. IMSCCS '06. First International Multi-Symposiums on
  • Conference_Location
    Hanzhou, Zhejiang
  • Print_ISBN
    0-7695-2581-4
  • Type

    conf

  • DOI
    10.1109/IMSCCS.2006.133
  • Filename
    4673528