Title :
Prediction of protein structural class using a combined representation of protein-sequence information and support vector machine
Author :
Wu, Li ; Dai, Qi ; Han, Bin ; Zhu, Lei ; Li, Lihua
Author_Institution :
Coll. of Life Inf. Sci. & Instrum. Eng., Hangzhou Dianzi Univ., Hangzhou, China
Abstract :
Knowledge of structural classes is useful in understanding of folding patterns in proteins. Although numerous methods were proposed and achieved promising results in structural class prediction, some problems in using protein-sequence information have impeded the development. In this paper, a combined representation of protein-sequence information is proposed for prediction of protein structural class, which combines word frequencies, word position information and physicochemical properties of amino acids. Then the support vector machine classifier is adopted to classify attributes of protein. To check the validity, we use three benchmark datasets and jackknife cross-validation to evaluate the proposed method. Results show that the proposed combined representation of protein-sequence information is more efficient, which indicates that the necessity for protein structural class prediction method to extract more information as possible.
Keywords :
bioinformatics; molecular biophysics; molecular configurations; pattern classification; proteins; support vector machines; SVM classifier; amino acid physicochemical properties; protein folding patterns; protein structural class prediction; protein-sequence information representation; support vector machine; word frequencies; word position information; Physicochemical properties of amino acids; Protein structural class; Support vector machine; Word frequencies; Word positional information;
Conference_Titel :
Bioinformatics and Biomedicine Workshops (BIBMW), 2010 IEEE International Conference on
Conference_Location :
Hong, Kong
Print_ISBN :
978-1-4244-8303-7
Electronic_ISBN :
978-1-4244-8304-4
DOI :
10.1109/BIBMW.2010.5703781