• DocumentCode
    1586568
  • Title

    Applying Bayesian classification to protein structure

  • Author

    Hunter, Lawrence ; States, David J.

  • Author_Institution
    Nat. Libr. of Med., Bethesda, MD, USA
  • fYear
    1991
  • Firstpage
    10
  • Lastpage
    16
  • Abstract
    A report is given on the advantages of Bayesian classification over traditional methods and the challenges in applying the Autoclass III program, a heuristic Bayesian classifier, in the domain of biotechnology and protein structure classification. The machine learning technique of heuristic Bayesian classification specifically addresses the question of how many classes a dataset should be divided into, as well as what the classifications should be. The method is based on a minimal message length description of the dataset. The cost (in bits) of specifying a classification is added to the cost of accounting for each exemplar in terms of its distance from the class definition and the total cost is minimized. In addition to providing a well founded estimate of the number of classes necessary to optimally characterize a dataset, this method also generates test classifications where within-class variances differ significantly
  • Keywords
    Bayes methods; classification; data structures; learning systems; medical computing; proteins; Autoclass III program; Bayesian classification; biotechnology; class definition; dataset; heuristic Bayesian classifier; machine learning technique; minimal message length description; protein structure classification; test classifications; within-class variances; Amino acids; Bayesian methods; Biotechnology; Crystallography; Databases; Libraries; Machine learning algorithms; Proteins; Sequences; Shape;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Artificial Intelligence Applications, 1991. Proceedings., Seventh IEEE Conference on
  • Conference_Location
    Miami Beach, FL
  • Print_ISBN
    0-8186-2135-4
  • Type

    conf

  • DOI
    10.1109/CAIA.1991.120838
  • Filename
    120838