• DocumentCode
    2076095
  • Title

    Using background knowledge to improve inductive learning of DNA sequences

  • Author

    Hirsh, Haym ; Noordewier, Michiel

  • Author_Institution
    Dept. of Comput. Sci., Rutgers Univ., New Brunswick, NJ, USA
  • fYear
    1994
  • fDate
    1-4 Mar 1994
  • Firstpage
    351
  • Lastpage
    357
  • Abstract
    Successful inductive learning requires that training data be expressed in a form where underlying regularities can be recognized by the learning system. Unfortunately, many applications of inductive learning-especially in the domain of molecular biology-have assumed that data are provided in a form already suitable for learning, whether or not such an assumption is actually justified. This paper describes the use of background knowledge of molecular biology to re-express data into a form more appropriate for learning. Our results show dramatic improvements in classification accuracy for two very different classes of DNA sequences using traditional “off-the-sheIf” decision-tree and neural-network inductive-learning methods
  • Keywords
    DNA; biology computing; inference mechanisms; learning (artificial intelligence); neural nets; pattern recognition; trees (mathematics); DNA sequences; background knowledge; classification accuracy; decision tree; inductive learning; molecular biology; neural network; training data; underlying regularities; Biological information theory; DNA; Data mining; Encoding; Polymers; Sampling methods; Sequences; Speech; Training data; US Department of Energy;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Artificial Intelligence for Applications, 1994., Proceedings of the Tenth Conference on
  • Conference_Location
    San Antonia, TX
  • Print_ISBN
    0-8186-5550-X
  • Type

    conf

  • DOI
    10.1109/CAIA.1994.323654
  • Filename
    323654