DocumentCode :
1840427
Title :
Extracting Sequence Features to Predict DNA-Binding Proteins Using Support Vector Machine
Author :
Xin Ma ; Lefu Hu
Author_Institution :
Golden Audit Coll., Nanjing Audit Univ., Nanjing, China
fYear :
2013
fDate :
21-23 June 2013
Firstpage :
152
Lastpage :
155
Abstract :
DNA-binding proteins plays important role in a variety of vital biology processes. In this study, we apply a machine learning method for classify DNA-binding proteins from non-binding proteins based on sequence information. Using an evolutionary feature and residue composition feature extracted from primary structure, we have trained a support vector machine(SVM) to distinguish DNA-binding proteins from other proteins that do not binding DNA. The prediction performances are evaluated by independent test dataset which contains 361 DNA-binding proteins and 361 non-binding proteins. Our proposed method outperforms the other existing methods in the test dataset. The results achieved by our proposed method for accuracy, 84.16%; sensitivity, 85.47%; specificity, 82.89% and Matthews correlation coefficient(MCC), 0.5828 demonstrate its good performance.
Keywords :
DNA; biology computing; feature extraction; molecular biophysics; proteins; support vector machines; DNA-binding proteins; biology processes; evolutionary feature; sequence feature extraction; sequence information; support vector machine; Accuracy; Amino acids; DNA; Proteins; Sensitivity; Support vector machines; Training; DNA-binding proteins; Support vector machine; residues compositions; specific scoring matrice (PSSM);
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational and Information Sciences (ICCIS), 2013 Fifth International Conference on
Conference_Location :
Shiyang
Type :
conf
DOI :
10.1109/ICCIS.2013.48
Filename :
6642963
Link To Document :
بازگشت