DocumentCode :
2639209
Title :
Mining association rules: anti-skew algorithms
Author :
Lin, Jun-Lin ; Dunham, Margaret H.
Author_Institution :
Dept. of Comput. Sci. & Eng., Southern Methodist Univ., Dallas, TX, USA
fYear :
1998
fDate :
23-27 Feb 1998
Firstpage :
486
Lastpage :
493
Abstract :
Mining association rules among items in a large database has been recognized as one of the most important data mining problems. All proposed approaches for this problem require scanning the entire database at least or almost twice in the worst case. We propose several techniques which overcome the problem of data skew in the basket data. These techniques reduce the maximum number of scans to less than 2, and in most cases find all association rules in about 1 scan. Our algorithms employ prior knowledge collected during the mining process and/or via sampling, to further reduce the number of candidate itemsets and identify false candidate itemsets at an earlier stage
Keywords :
deductive databases; knowledge acquisition; very large databases; anti skew algorithms; association rule mining; association rules; basket data; candidate itemsets; data mining problems; data skew; large database; mining process; prior knowledge; sampling; Association rules; Computer science; Data engineering; Data mining; Data systems; Decision making; Itemsets; Partitioning algorithms; Sampling methods; Transaction databases;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering, 1998. Proceedings., 14th International Conference on
Conference_Location :
Orlando, FL
ISSN :
1063-6382
Print_ISBN :
0-8186-8289-2
Type :
conf
DOI :
10.1109/ICDE.1998.655811
Filename :
655811
Link To Document :
بازگشت