• DocumentCode
    3153000
  • Title

    GOASVM: Protein subcellular localization prediction based on Gene ontology annotation and SVM

  • Author

    Wan, Shibiao ; Mak, Man-Wai ; Kung, Sun-Yuan

  • Author_Institution
    Dept. of Electron. & Inf. Eng., Hong Kong Polytech. Univ., Hong Kong, China
  • fYear
    2012
  • fDate
    25-30 March 2012
  • Firstpage
    2229
  • Lastpage
    2232
  • Abstract
    Protein subcellular localization is an essential step to annotate proteins and to design drugs. This paper proposes a functional-domain based method-GOASVM-by making full use of Gene Ontology Annotation (GOA) database to predict the subcellular locations of proteins. GOASVM uses the accession number (AC) of a query protein and the accession numbers (ACs) of homologous proteins returned from PSI-BLAST as the query strings to search against the GOA database. The occurrences of a set of predefined GO terms are used to construct the GO vectors for classification by support vector machines (SVMs). The paper investigated two different approaches to constructing the GO vectors. Experimental results suggest that using the ACs of homologous proteins as the query strings can achieve an accuracy of 94.68%, which is significantly higher than all published results based on the same dataset. As a user-friendly web-server, GOASVM is freely accessible to the public at http://bioinfo.eie.polyu.edu.hk/mGoaSvmServer/GOASVM.html.
  • Keywords
    Internet; cellular biophysics; drugs; genetics; human computer interaction; molecular biophysics; ontologies (artificial intelligence); proteins; support vector machines; GO vectors; GOA database; GOASVM; PSI-BLAST; accession number; drugs; functional-domain based method; gene ontology annotation database; homologous proteins; protein subcellular localization prediction; query protein; query strings; support vector machines; user-friendly Web-server; Amino acids; Databases; Ontologies; Proteins; Support vector machines; Training; Vectors; GO terms; Gene Ontology; Gene Ontology Annotation; Protein subcellular localization; Support vector machines;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
  • Conference_Location
    Kyoto
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4673-0045-2
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2012.6288356
  • Filename
    6288356