DocumentCode
3408686
Title
Weighting features to recognize 3D patterns of electron density in X-ray protein crystallography
Author
Gopal, Kreshna ; Romo, Tod D. ; Sacchettini, James C. ; Ioerger, Thomas R.
Author_Institution
Dept. of Comput. Sci., Texas A&M Univ., TX, USA
fYear
2004
fDate
16-19 Aug. 2004
Firstpage
255
Lastpage
265
Abstract
Feature selection and weighting are central problems in pattern recognition and instance-based learning. In this work, we discuss the challenges of constructing and weighting features to recognize 3D patterns of electron density to determine protein structures. We present SLIDER, a feature-weighting algorithm that adjusts weights iteratively such that patterns that match query instances are better ranked than mismatching ones. Moreover, SLIDER makes judicious choices of weight values to be considered in each iteration, by examining specific weights at which matching and mismatching patterns switch as nearest neighbors to query instances. This approach reduces the space of weight vectors to be searched. We make the following two main observations: (1) SLIDER efficiently generates weights that contribute significantly in the retrieval of matching electron density patterns; (2) the optimum weight vector is sensitive to the distance metric i.e. feature relevance can be, to a certain extent, sensitive to the underlying metric used to compare patterns.
Keywords
X-ray crystallography; biology computing; crystal structure; iterative methods; learning (artificial intelligence); molecular biophysics; molecular configurations; pattern matching; proteins; 3D pattern recognition; SLIDER; X-ray protein crystallography; electron density; feature selection; feature weighting; instance-based learning; iterative method; matching patterns; mismatching patterns; optimum weight vector; protein structures; query instances; Crystallography; Electrons; Filters; Iterative algorithms; Machine learning; Nearest neighbor searches; Pattern matching; Pattern recognition; Proteins; Switches;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational Systems Bioinformatics Conference, 2004. CSB 2004. Proceedings. 2004 IEEE
Print_ISBN
0-7695-2194-0
Type
conf
DOI
10.1109/CSB.2004.1332439
Filename
1332439
Link To Document