• DocumentCode
    1149809
  • Title

    An information theoretic approach to rule induction from databases

  • Author

    Smyth, Padhraic ; Goodman, Rodney M.

  • Author_Institution
    California Inst. of Technol., Pasadena, CA, USA
  • Volume
    4
  • Issue
    4
  • fYear
    1992
  • fDate
    8/1/1992 12:00:00 AM
  • Firstpage
    301
  • Lastpage
    316
  • Abstract
    An algorithm for the induction of rules from examples is introduced. The algorithm is novel in the sense that it not only learns rules for a given concept (classification), but it simultaneously learns rules relating multiple concepts. This type of learning, known as generalized rule induction, is considerably more general than existing algorithms, which tend to be classification oriented. Initially, it is focused on the problem of determining a quantitative, well-defined rule preference measure. In particular, a quantity called the J-measure is proposed as an information-theoretic alternative to existing approaches. The J-measure quantifies the information content of a rule or a hypothesis. The information theoretic origins of this measure are outlined, and its plausibility as a hypothesis preference measure is examined. The ITRULE algorithm, which uses the measure to learn a set of optimal rules from a set of data samples, is defined. Experimental results on real-world data are analyzed
  • Keywords
    database management systems; expert systems; information theory; knowledge acquisition; learning systems; ITRULE algorithm; J-measure; generalized rule induction; hypothesis preference measure; information theoretic approach; learning; multiple concepts; rule induction from databases; rule preference measure; Algorithm design and analysis; Diagnostic expert systems; Expert systems; Humans; Knowledge acquisition; Laboratories; Machine learning algorithms; Propulsion; Space technology; Spatial databases;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/69.149926
  • Filename
    149926