DocumentCode :
1065961
Title :
Specifying mining algorithms with iterative user-defined aggregates
Author :
Giannotti, Fosca ; Manco, Giuseppe ; Turini, Franco
Author_Institution :
CNR, Pisa, Italy
Volume :
16
Issue :
10
fYear :
2004
Firstpage :
1232
Lastpage :
1246
Abstract :
We present a way of exploiting domain knowledge in the design and implementation of data mining algorithms, with special attention to frequent patterns discovery, within a deductive framework. In our framework, domain knowledge is represented by way of deductive rules, and data mining algorithms are specified by means of iterative user-defined aggregates and implemented by means of user-defined predicates. This choice allows us to exploit the full expressive power of deductive rules without loosing in performance. Iterative user-defined aggregates have a fixed scheme, in which user-defined predicates are to be added. This feature allows the modularization of data mining algorithms, thus providing a way to integrate the proper domain knowledge exploitation in the right point. As a case study, we present how user-defined aggregates can be exploited to specify and implement a version of the a priori algorithm. Some performance analyzes and comparisons are discussed in order to show the effectiveness of the approach.
Keywords :
data mining; deductive databases; knowledge based systems; logic programming languages; pattern recognition; query languages; query processing; tree data structures; association rules; constraint language; data mining algorithm modularization; deductive rules; domain knowledge exploitation; iterative user-defined aggregates; logic language; pattern discovery; query language; rule-based databases; user-defined predicates; Aggregates; Algorithm design and analysis; Association rules; Data mining; Database languages; Deductive databases; Helium; Iterative algorithms; Logic; Performance analysis; 65; Index Terms- Data mining; association rules.; constraint and logic languages; query languages; rule-based databases; user-defined aggregates;
fLanguage :
English
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1041-4347
Type :
jour
DOI :
10.1109/TKDE.2004.64
Filename :
1324631
Link To Document :
بازگشت