DocumentCode :
1992451
Title :
Toward The Recognition Code Of Protein-DNA Recognition
Author :
Shan, Juan ; Wang, Yuxuan ; Yan, Changhui
Author_Institution :
Utah State Univ., Logan
fYear :
2007
fDate :
14-17 Oct. 2007
Firstpage :
1290
Lastpage :
1293
Abstract :
Discovering the "recognition code" governing protein-DNA interaction has been an important topic for decades in bioinformatics. While other studies have focused on analyzing the frequency of amino acid-base contacts, this study here attempts to discover the structural and physicochemical features of proteins that determine the specificity of amino acid-base contacts. For each amino acid that contacts with DNA, we attempt to predict the type of bases (purines or pyrimidines) that it contacts. We extract 8 structural and physicochemical features from proteins and use a bottom-up approach to search for the combination of features that can be used to predict the specificity of amino acid-base contacts. In the end, 4 features are selected. Using these features, a support vector machine method can achieve 67.1% accuracy with 0.329 MCC in predicting the type of base (purines or pyrimidines) that an amino acid contacts. Analyzing the selected features will provide insights into the "recognition code" of protein-DNA interaction.
Keywords :
DNA; biology computing; learning (artificial intelligence); molecular biophysics; molecular configurations; proteins; support vector machines; amino acid-base contacts; bioinformatics; machine-learning; protein-DNA interaction; protein-DNA recognition; purines; pyrimidines; recognition code; support vector machine method; Amino acids; Bioinformatics; Computer science; DNA; Feature extraction; Frequency; Proteins; Sequences; Support vector machines; Target recognition; Protein-DNA; machine-learning; recognition code;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Bioengineering, 2007. BIBE 2007. Proceedings of the 7th IEEE International Conference on
Conference_Location :
Boston, MA
Print_ISBN :
978-1-4244-1509-0
Type :
conf
DOI :
10.1109/BIBE.2007.4375733
Filename :
4375733
Link To Document :
بازگشت