DocumentCode :
477815
Title :
An Efficient Algorithm for Mining Large Item Sets
Author :
Zheng, Hong-Zhen ; Chu, Dian-Hui ; Zhan, De-chen ; Xu, Xiao-Fei
Author_Institution :
Coll. of Comput. Sci. & Technol., Harbin Inst. of Technol., Weihai
Volume :
2
fYear :
2008
fDate :
18-20 Oct. 2008
Firstpage :
561
Lastpage :
564
Abstract :
It propose online mining algorithm ( OMA) which online discover large item sets. Without pre-setting a default threshold, the OMA algorithm achieves its efficiency and threshold-flexibility by calculating item-setspsila counts. It is unnecessary and independent of the default threshold and can flexibly adapt to any userpsilas input threshold. In addition, we propose cluster-based association rule algorithm (CARA) creates cluster tables to aid discovery of large item sets. It only requires a single scan of the database, followed by contrasts with the partial cluster tables. It not only prunes considerable amounts of data reducing the time needed to perform data scans and requiring less contrast, but also ensures the correctness of the mined results. By using the CARA algorithm to create cluster tables in advance, each CPU can be utilized to process a cluster table; thus large item sets can be immediately mined even when the database is very large.
Keywords :
data mining; pattern clustering; cluster-based association rule algorithm; default threshold; efficient algorithm; large item set mining; online mining; partial cluster tables; threshold flexibility; Appropriate technology; Association rules; Clustering algorithms; Computer science; Data mining; Databases; Educational institutions; Fuzzy systems; Itemsets; Partitioning algorithms; Association rules; Data mining; Large item sets;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Fuzzy Systems and Knowledge Discovery, 2008. FSKD '08. Fifth International Conference on
Conference_Location :
Shandong
Print_ISBN :
978-0-7695-3305-6
Type :
conf
DOI :
10.1109/FSKD.2008.679
Filename :
4666179
Link To Document :
بازگشت