• DocumentCode
    1597179
  • Title

    A platform for applying multiple machine learning strategies to the task of understanding gene structure

  • Author

    Overton, G. Christian ; Pastor, Jon A.

  • Author_Institution
    Unisys CAIT, Paoli, PA, USA
  • fYear
    1991
  • Firstpage
    450
  • Lastpage
    457
  • Abstract
    Describes a system to support multiple machine learning strategies applied to the analysis of gene structure in general and the fine-structure of gene regulatory regions in particular. The focus is on two related tasks. The first is to take an overly generalized description of gene structure, represented as a formal grammar, and generate a detailed fine-structure description of a gene family (in this case the, β-hemoglobin family); this amounts to a restricted form of grammar induction. The second task is to identify patterns in the sequence of uncharacterized DNA that specify functional regions involved in gene regulation. Because too few properly classified training examples are usually available to support statistical induction of pattern descriptors, the authors have instead used a variant of case-based reasoning to identify patterns in uncharacterized DNA by comparison to analogous cases in well-characterized DNA
  • Keywords
    DNA; biology computing; computerised pattern recognition; grammars; inference mechanisms; learning systems; β-hemoglobin family; case-based reasoning; fine-structure description; formal grammar; functional regions; gene regulatory regions; gene structure; generalized description; grammar induction; multiple machine learning strategies; pattern descriptors; statistical induction; uncharacterized DNA; Artificial intelligence; DNA; Databases; Genetics; Induction generators; Information analysis; Logic; Machine learning; Proteins; Sequences;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Artificial Intelligence Applications, 1991. Proceedings., Seventh IEEE Conference on
  • Conference_Location
    Miami Beach, FL
  • Print_ISBN
    0-8186-2135-4
  • Type

    conf

  • DOI
    10.1109/CAIA.1991.120907
  • Filename
    120907