• DocumentCode
    2591893
  • Title

    Using gnome wide data for protein function prediction by exploiting gene ontology relationships

  • Author

    Benso, Alfredo ; Carlo, Stefano Di ; urRehman, Hafeez ; Politano, Gianfranco ; Savino, Alessandro

  • Author_Institution
    Dept. of Control & Comput. Eng, Politec. di Torino, Torino, Italy
  • fYear
    2012
  • fDate
    24-27 May 2012
  • Firstpage
    497
  • Lastpage
    502
  • Abstract
    Many new therapeutic techniques depend not only on the knowledge of the molecules participating in the biological phenomena but also their biochemical function. Advancements in prediction of new proteins are immense if compared with the annotation of functionally unknown proteins. To accelerate the personalized medicine effort, computational techniques should be used in a smart way to accurately predict protein function. In this paper, we propose and evaluate a technique that utilizes integrated biological data from different online databases. We use this information along-with Gene Ontology (GO) relationships of functional annotations in a wide-ranging way to accurately predict protein function. We integrate PPI (Protein Protein Interactions) data, protein motifs information, and protein homology data, with a semantic similarity measure based on Gene Ontology to infer functional information for unannotated proteins. Our method is applied to predict function of a subset of homo sapiens species proteins. The integrated approach with GO relationships provides substantial improvement in precision and accuracy when compared to functional links without GO relationships. We provide a comprehensive assignment of annotated GO terms to many proteins that currently are not assigned any function.
  • Keywords
    biology computing; data handling; genetics; molecular biophysics; ontologies (artificial intelligence); proteins; GO relationships; PPI data; biochemical function; biological phenomena; computational techniques; functional annotations; gene ontology relationships; homo sapiens species proteins; integrated biological data; online databases; protein function prediction; protein homology data; protein motif information; protein unannotation; protein-protein interactions; semantic similarity measure; therapeutic techniques; Context; Databases; Proteins; Function Prediction; Gene Ontology; Protein Protein Interactions; Protein motifs;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automation Quality and Testing Robotics (AQTR), 2012 IEEE International Conference on
  • Conference_Location
    Cluj-Napoca
  • Print_ISBN
    978-1-4673-0701-7
  • Type

    conf

  • DOI
    10.1109/AQTR.2012.6237762
  • Filename
    6237762