• DocumentCode
    1961913
  • Title

    Sampling issues in generating rules from databases

  • Author

    Lee, Changhwan

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Univ. of Connecticut, Storrs, CT, USA
  • fYear
    1993
  • fDate
    8-11 Nov 1993
  • Firstpage
    435
  • Lastpage
    439
  • Abstract
    There have been a number of studies concerning inductive rule generation from databases. All inductive rules are based on instances of the databases, and such instances can be regarded as the sample of the population in the real world. Therefore, the validity, unbiasedness and correctness of these instances cannot be overemphasized in the rule induction environment. While many researchers have focused on the methods of generating rules from databases, the author discusses some sampling issues that occur in rule generation from databases. The author tries to bridge the gap between sampling in statistics and rule generation in databases. Two sampling problems-small sample size and biased sample-which occur mostly in rule induction were studied. The author investigates how these problems affect the validity of rule induction and provides a set of criteria for a rule induction system to generate feasible rules
  • Keywords
    database theory; deductive databases; inference mechanisms; biased sample; correctness; database instances; feasible rules; rule generation; rule induction system; small sample size; statistics; unbiasedness; validity; Artificial intelligence; Bridges; Computer science; Data engineering; Databases; Diseases; Induction generators; Medical diagnostic imaging; Sampling methods; Statistics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Tools with Artificial Intelligence, 1993. TAI '93. Proceedings., Fifth International Conference on
  • Conference_Location
    Boston, MA
  • ISSN
    1063-6730
  • Print_ISBN
    0-8186-4200-9
  • Type

    conf

  • DOI
    10.1109/TAI.1993.633992
  • Filename
    633992