Title :
Constraint-based rule mining in large, dense databases
Author :
Bayardo, Roberto J., Jr. ; Agrawal, Rakesh ; Gunopulos, Dimitrios
Author_Institution :
IBM Almaden Res. Center, San Jose, CA, USA
Abstract :
Constraint-based rule miners find all rules in a given dataset meeting user-specified constraints such as minimum support and confidence. We describe a new algorithm that directly exploits all user-specified constraints including minimum support, minimum confidence, and a new constraint that ensures every mined rule offers a predictive advantage over any of its simplifications. Our algorithm maintains efficiency even at low supports on data that is dense (e.g. relational data). Previous approaches such as Apriori and its variants exploit only the minimum support constraint, and as a result are ineffective on dense data date to a combinatorial explosion of “frequent itemsets”
Keywords :
constraint handling; data mining; very large databases; algorithm; constraint-based rule mining; dataset; large dense databases; minimum confidence; minimum support; predictive advantage; relational data; user-specified constraints; Association rules; Dairy products; Data mining; Databases; Decision trees; Electrical capacitance tomography; Explosions; Failure analysis; Itemsets; Proposals;
Conference_Titel :
Data Engineering, 1999. Proceedings., 15th International Conference on
Conference_Location :
Sydney, NSW
Print_ISBN :
0-7695-0071-4
DOI :
10.1109/ICDE.1999.754924