DocumentCode :
2143310
Title :
RßHpred: Prediction of Right-Handed ß-Helix Fold from Protein Sequence Using SVM and Protein Threading
Author :
Singh, Siddharth ; Hajela, Krishnan ; Ramani, Ashwini
Author_Institution :
Devi Ahilya Univ., Indore
fYear :
2007
fDate :
16-19 Oct. 2007
Firstpage :
1116
Lastpage :
1121
Abstract :
The right-handed single-stranded beta-helix proteins characterized as virulence factors, allergens and toxins are threat to human health. Identification of these proteins from primary sequence is of great importance in bio-medicine and medical microbiology. In this paper, support vector machine (SVM) has been used to predict the presence of beta-helix fold in protein sequences using dipeptide composition. Input vector of 400 dimensions is used to search for the presence of conserved secondary structure called rungs in beta-helix proteins. A maximum accuracy of 90.1% and Matthew´s correlation coefficient of 0.77 is obtained in a 5-fold cross-validation procedure. In addition, a position specific scoring matrix (PSSM) is also used to score putative rung sequences identified by SVM. Finally, the predicted beta-helix proteins are threaded against a custom beta-helix template library to achieve high prediction confidence. The method recognizes right-handed beta-helices with 100% sensitivity and 99.8% specificity on a test set of known protein structures.
Keywords :
correlation methods; matrix algebra; medicine; support vector machines; Matthew correlation coefficient; SVM; dipeptide composition; medical microbiology; position specific scoring matrix; protein sequence; protein threading; right-handed beta-helix fold; support vector machine; virulence factors; Accuracy; Amino acids; Coils; Information technology; Libraries; Protein engineering; Protein sequence; Spine; Support vector machines; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer and Information Technology, 2007. CIT 2007. 7th IEEE International Conference on
Conference_Location :
Aizu-Wakamatsu, Fukushima
Print_ISBN :
978-0-7695-2983-7
Type :
conf
DOI :
10.1109/CIT.2007.75
Filename :
4385235
Link To Document :
بازگشت