شماره ركورد :
20432
عنوان به زبان ديگر :
An Efficient Sampling Approach for Mining all Association Rules in Large Databases
پديد آورندگان :
Deypir M نويسنده , Sadreddini M H نويسنده
از صفحه :
73
تا صفحه :
78
تعداد صفحه :
6
چكيده لاتين :
Mining association rules is an interesting problem in the field of knowledge discovery and data mining. In this paper a fast, new method for mining association rules is presented. In this new sampling approach, which utilizes FP-Growth to mine sample data, the candidate generation and test in the sample data is omitted because of the FP-Tree projection of sample data in the main memory. To overcome the main memory limitation in the new sampling method, a useful technique is proposed. Theoretical time consideration and empirical evaluation show that the new sampling approach is superior to the traditional Apriori-based sampling by orders of magnitude. Experimental evaluations on artificial and real-life datasets show that our approach, compared with a previously proposed sampling algorithm, is more efficient when the minimal support threshold is decreased and is also more stable as the size of the sample data increases.
شماره مدرك :
1204469
لينک به اين مدرک :
بازگشت