DocumentCode
3374704
Title
Identification of GPI-(like)-Anchored Proteins by Using SVM
Author
Cao, Wei ; Shimizu, Kentaro
Author_Institution
Dept. of Biotechnol., Tokyo Univ.
Volume
2
fYear
2006
fDate
20-24 June 2006
Firstpage
711
Lastpage
715
Abstract
A new simple method for identification of GPI-(like)-anchored proteins at sequence level is proposed in this paper. As a binary classifier of GPI-(like)-anchored proteins and non GPI-(like)-anchored proteins, a supervised machine learning algorithm, support vector machine (SVM) and simple representation of C-terminus of protein primary sequences with mean hydrophobicity were used. Not merely does the classifier show high accuracy of 96.00% under 5-fold cross validation test, but also the A UC as a good summary of the performance of the classifier reaches to 0.97. In virtue of being based on SVM, computational efficiency and remarkable generalization ability of our classifier would be helpful for protein annotation in whole genomic-wide
Keywords
biology computing; learning (artificial intelligence); molecular biophysics; pattern classification; proteins; sequences; support vector machines; C-terminus representation; GPI-(like)-anchored protein identification; SVM; binary classifier; mean hydrophobicity; protein primary sequence; sequence; supervised machine learning algorithm; support vector machine; Bioinformatics; Biotechnology; Computational efficiency; Genomics; Machine learning algorithms; Optimization methods; Protein sequence; Support vector machine classification; Support vector machines; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer and Computational Sciences, 2006. IMSCCS '06. First International Multi-Symposiums on
Conference_Location
Hanzhou, Zhejiang
Print_ISBN
0-7695-2581-4
Type
conf
DOI
10.1109/IMSCCS.2006.232
Filename
4673790
Link To Document