Title :
Set-oriented mining for association rules in relational databases
Author :
Houtsma, Maurice ; Swami, Arun
Author_Institution :
Twente Univ., Enschede, Netherlands
Abstract :
Describe set-oriented algorithms for mining association rules. Such algorithms imply performing multiple joins and may appear to be inherently less efficient than special-purpose algorithms. We develop new algorithms that can be expressed as SQL queries, and discuss the optimization of these algorithms. After analytical evaluation, an algorithm named SETM emerges as the algorithm of choice. SETM uses only simple database primitives, viz. sorting and merge-scan join. SETM is simple, fast and stable over the range of parameter values. The major contribution of this paper is that it shows that at least some aspects of data mining can be carried out by using general query languages such as SQL, rather than by developing specialized black-box algorithms. The set-oriented nature of SETM facilitates the development of extensions
Keywords :
SQL; database theory; deductive databases; knowledge acquisition; merging; optimisation; query processing; relational databases; set theory; sorting; SETM; SQL queries; algorithm optimization; association rules; data mining; database primitives; efficiency; extensions; general query languages; merge-scan join; multiple joins; relational databases; set-oriented mining; sorting; Algorithm design and analysis; Artificial intelligence; Association rules; Companies; Data mining; Database languages; Decision making; Marketing and sales; Relational databases; Sorting;
Conference_Titel :
Data Engineering, 1995. Proceedings of the Eleventh International Conference on
Conference_Location :
Taipei
Print_ISBN :
0-8186-6910-1
DOI :
10.1109/ICDE.1995.380413