DocumentCode :
1565615
Title :
Efficient data-structures and parallel algorithms for association rules discovery
Author :
Cérin, Christophe ; Gay, Jean-Sébatien ; Le Mahec, Gaël ; Koskas, Michel
Author_Institution :
Univ. de Picardie Jules Verne, Amiens, France
fYear :
2004
Firstpage :
399
Lastpage :
406
Abstract :
Discovering patterns or frequent episodes in transactions is an important problem in data mining for the purpose of infering deductive rules from them. Because of the huge size of the data to deal with, parallel algorithms have been designed for reducing both the execution time and the number of repeated passes over the database in order to reduce, as much as possible, I/O overheads. In this paper, we introduce approaches for the implementation of two basic algorithms for association rules discovery (namely Apriori and Eclat). Our approaches combine efficient data structures to code different key information (line indexes, candidates) and we exhibit how to introduce parallelism for processing such data-structures.
Keywords :
data mining; data structures; database management systems; inference mechanisms; parallel algorithms; Apriori; Eclat; association rules discovery; bit vectors; data mining; data structures; deductive rule inference; parallel algorithms; radix trees; Algorithm design and analysis; Association rules; Data mining; Data structures; Inference algorithms; Itemsets; Parallel algorithms; Parallel processing; Transaction databases; Tree data structures;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Science, 2004. ENC 2004. Proceedings of the Fifth Mexican International Conference in
Print_ISBN :
0-7695-2160-6
Type :
conf
DOI :
10.1109/ENC.2004.1342634
Filename :
1342634
Link To Document :
بازگشت