DocumentCode
2377882
Title
Prediction of protein structural class using a combined representation of protein-sequence information and support vector machine
Author
Wu, Li ; Dai, Qi ; Han, Bin ; Zhu, Lei ; Li, Lihua
Author_Institution
Coll. of Life Inf. Sci. & Instrum. Eng., Hangzhou Dianzi Univ., Hangzhou, China
fYear
2010
fDate
18-18 Dec. 2010
Firstpage
101
Lastpage
106
Abstract
Knowledge of structural classes is useful in understanding of folding patterns in proteins. Although numerous methods were proposed and achieved promising results in structural class prediction, some problems in using protein-sequence information have impeded the development. In this paper, a combined representation of protein-sequence information is proposed for prediction of protein structural class, which combines word frequencies, word position information and physicochemical properties of amino acids. Then the support vector machine classifier is adopted to classify attributes of protein. To check the validity, we use three benchmark datasets and jackknife cross-validation to evaluate the proposed method. Results show that the proposed combined representation of protein-sequence information is more efficient, which indicates that the necessity for protein structural class prediction method to extract more information as possible.
Keywords
bioinformatics; molecular biophysics; molecular configurations; pattern classification; proteins; support vector machines; SVM classifier; amino acid physicochemical properties; protein folding patterns; protein structural class prediction; protein-sequence information representation; support vector machine; word frequencies; word position information; Physicochemical properties of amino acids; Protein structural class; Support vector machine; Word frequencies; Word positional information;
fLanguage
English
Publisher
ieee
Conference_Titel
Bioinformatics and Biomedicine Workshops (BIBMW), 2010 IEEE International Conference on
Conference_Location
Hong, Kong
Print_ISBN
978-1-4244-8303-7
Electronic_ISBN
978-1-4244-8304-4
Type
conf
DOI
10.1109/BIBMW.2010.5703781
Filename
5703781
Link To Document