• DocumentCode
    3408686
  • Title

    Weighting features to recognize 3D patterns of electron density in X-ray protein crystallography

  • Author

    Gopal, Kreshna ; Romo, Tod D. ; Sacchettini, James C. ; Ioerger, Thomas R.

  • Author_Institution
    Dept. of Comput. Sci., Texas A&M Univ., TX, USA
  • fYear
    2004
  • fDate
    16-19 Aug. 2004
  • Firstpage
    255
  • Lastpage
    265
  • Abstract
    Feature selection and weighting are central problems in pattern recognition and instance-based learning. In this work, we discuss the challenges of constructing and weighting features to recognize 3D patterns of electron density to determine protein structures. We present SLIDER, a feature-weighting algorithm that adjusts weights iteratively such that patterns that match query instances are better ranked than mismatching ones. Moreover, SLIDER makes judicious choices of weight values to be considered in each iteration, by examining specific weights at which matching and mismatching patterns switch as nearest neighbors to query instances. This approach reduces the space of weight vectors to be searched. We make the following two main observations: (1) SLIDER efficiently generates weights that contribute significantly in the retrieval of matching electron density patterns; (2) the optimum weight vector is sensitive to the distance metric i.e. feature relevance can be, to a certain extent, sensitive to the underlying metric used to compare patterns.
  • Keywords
    X-ray crystallography; biology computing; crystal structure; iterative methods; learning (artificial intelligence); molecular biophysics; molecular configurations; pattern matching; proteins; 3D pattern recognition; SLIDER; X-ray protein crystallography; electron density; feature selection; feature weighting; instance-based learning; iterative method; matching patterns; mismatching patterns; optimum weight vector; protein structures; query instances; Crystallography; Electrons; Filters; Iterative algorithms; Machine learning; Nearest neighbor searches; Pattern matching; Pattern recognition; Proteins; Switches;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Systems Bioinformatics Conference, 2004. CSB 2004. Proceedings. 2004 IEEE
  • Print_ISBN
    0-7695-2194-0
  • Type

    conf

  • DOI
    10.1109/CSB.2004.1332439
  • Filename
    1332439