• DocumentCode
    2373641
  • Title

    Efficient retrieval of electron density patterns for modeling proteins by X-ray crystallography

  • Author

    Gopal, K. ; Romo, T.D. ; Sacchettini, J.C. ; Ioerger, T.R.

  • fYear
    2004
  • fDate
    16-18 Dec. 2004
  • Firstpage
    380
  • Lastpage
    387
  • Abstract
    Inefficient case retrieval is a major problem in many case-based reasoning systems, especially when case matching is expensive and the case-base is large. In this paper, we present a two-phase approach where an inexpensive feature-based method is used to jind a set of potential matches and a more expensive and accurate case matching method is used to make the jinal selection. This approach has been successfully employed in TEXTALTM, a system that retrieves previously solved 3D patterns of electron density from a database to determine the structure of proteins. Electron density patterns are characterized by numeric features and an appropriate distance measure is used to efficiently jilter good matches through an exhaustive search of the database. These matches are then examined using a computationally expensive density correlation procedure based on jinding an optimal superposition between 3D patterns. We provide an empirical and theoretical analysis of some of the keys issues related to this method. In particular, we dejine a model for estimating how approximate various featurebased similarity measures are (relative to an objective matching metric), and determine its relation to the number of cases that should be jiltered from a given database to make the approach effective.
  • Keywords
    Computer science; Crystallography; Databases; Electrons; Information retrieval; Matched filters; Particle measurements; Pattern matching; Predictive models; Proteins;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Applications, 2004. Proceedings. 2004 International Conference on
  • Conference_Location
    Louisville, Kentucky, USA
  • Print_ISBN
    0-7803-8823-2
  • Type

    conf

  • DOI
    10.1109/ICMLA.2004.1383539
  • Filename
    1383539