DocumentCode
234700
Title
Proposed algorithm for frequent item set generation
Author
Singh, Ashutosh ; Agarwal, Jatin
Author_Institution
ASET, Amity Univ. Uttar Pradesh, Noida, India
fYear
2014
fDate
7-9 Aug. 2014
Firstpage
160
Lastpage
165
Abstract
Data mining is an efficient technology to discover patterns in large databases. Association rule mining techniques are used to find the correlation between the various item sets in the database, and this correlation between various item sets are used in decision making and pattern analysis. In recent years the problem of finding frequent items and association rules from large datasets has been proposed by many researchers. Various research papers on association rule mining (ARM) are studied and analyzed first to understand the existing algorithms. The Apriori algorithm is the basic ARM algorithm, but it requires so many database scans to find frequent items. In Dynamic Item set counting (DIC) algorithm less number of database scans are needed, but complex data structure lattice is used. The main focus of this paper is to propose a new optimized algorithm (FI-generator) and to compare its performance with the existing algorithms. A secondary data set is used to find out frequent item sets and association rules with the help of existing and proposed algorithm). We observed that the proposed algorithm find out the frequent item sets and association rules from databases as compared to the existing algorithms in less numbers of database scans. In the proposed algorithm an optimized data structure adjacency matrix is used. Proposed algorithm reduces the size of candidate-K item set in successive iteration. Pruning is also done at two stages which reduces the memory space.
Keywords
data mining; data structures; database management systems; decision making; ARM algorithm; Apriori algorithm; DIC algorithm; FI-generator algorithm; association rule mining techniques; candidate-K item set; complex data structure lattice; data mining; database scans; decision making; dynamic item set counting algorithm; frequent item set generation; optimized data structure adjacency matrix; pattern analysis; pattern discovery; Association rules; Heuristic algorithms; Partitioning algorithms; Solids; Transaction databases; Adjacency matrix; Association rules; Data mining; FI-generator; Frequent item sets;
fLanguage
English
Publisher
ieee
Conference_Titel
Contemporary Computing (IC3), 2014 Seventh International Conference on
Conference_Location
Noida
Print_ISBN
978-1-4799-5172-7
Type
conf
DOI
10.1109/IC3.2014.6897166
Filename
6897166
Link To Document