DocumentCode
1961913
Title
Sampling issues in generating rules from databases
Author
Lee, Changhwan
Author_Institution
Dept. of Comput. Sci. & Eng., Univ. of Connecticut, Storrs, CT, USA
fYear
1993
fDate
8-11 Nov 1993
Firstpage
435
Lastpage
439
Abstract
There have been a number of studies concerning inductive rule generation from databases. All inductive rules are based on instances of the databases, and such instances can be regarded as the sample of the population in the real world. Therefore, the validity, unbiasedness and correctness of these instances cannot be overemphasized in the rule induction environment. While many researchers have focused on the methods of generating rules from databases, the author discusses some sampling issues that occur in rule generation from databases. The author tries to bridge the gap between sampling in statistics and rule generation in databases. Two sampling problems-small sample size and biased sample-which occur mostly in rule induction were studied. The author investigates how these problems affect the validity of rule induction and provides a set of criteria for a rule induction system to generate feasible rules
Keywords
database theory; deductive databases; inference mechanisms; biased sample; correctness; database instances; feasible rules; rule generation; rule induction system; small sample size; statistics; unbiasedness; validity; Artificial intelligence; Bridges; Computer science; Data engineering; Databases; Diseases; Induction generators; Medical diagnostic imaging; Sampling methods; Statistics;
fLanguage
English
Publisher
ieee
Conference_Titel
Tools with Artificial Intelligence, 1993. TAI '93. Proceedings., Fifth International Conference on
Conference_Location
Boston, MA
ISSN
1063-6730
Print_ISBN
0-8186-4200-9
Type
conf
DOI
10.1109/TAI.1993.633992
Filename
633992
Link To Document