DocumentCode
2970965
Title
A Feature Selection Algorithm for Detecting Subtype Specific Functional Sites from Protein Sequences for Smad Receptor Binding
Author
Marchiori, Elena ; Pirovano, Walter ; Heringa, Jaap ; Feenstra, K. Anton
Author_Institution
Centre for Integrative Bioinformatics, Vrije Univ., Amsterdam
fYear
2006
fDate
Dec. 2006
Firstpage
168
Lastpage
173
Abstract
Multiple sequence alignments are often used to reveal functionally important residues within a protein family. In particular, they can be very useful for identification of key residues that determine functional differences between protein subclasses (subtype specific sites). This paper proposes a new algorithm for selecting subtype specific sites from a set of aligned protein sequences. The algorithm combines a feature selection technique with neighbor position information for selecting and ranking a set of putative relevant sites. The algorithm is applied to a dataset of protein sequences from the MH2 domain of the SMAD family of transcriptor factors. Validation of the results on the basis of the known interaction and function of the sites shows that the algorithm successfully identifies the known (from literature) subtype specific sites and new putative ones
Keywords
biology computing; feature extraction; proteins; set theory; MH2 domain; feature selection algorithm; multiple sequence alignment; protein sequence; putative relevant site; smad receptor binding; Adhesives; Amino acids; Bioinformatics; Boosting; Cellular networks; Entropy; Genomics; Proteins; Signal processing; State estimation;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Applications, 2006. ICMLA '06. 5th International Conference on
Conference_Location
Orlando, FL
Print_ISBN
0-7695-2735-3
Type
conf
DOI
10.1109/ICMLA.2006.7
Filename
4041487
Link To Document