Title :
A Novel Multivariate Discretization Method for Mining Association Rules
Author_Institution :
Sch. of Software, Nanchang Univeristy, Nanchang, China
Abstract :
Data mining aims at discovering useful patterns in large datasets. In this paper, we present a novel multivariate discretization method for finding association patterns based on clustering and genetic algorithm. This method consists of two steps. Firstly we adopt a density-based clustering technique to identify the regions that possibly hide the interesting patterns from data space. Confined to the data in these regions, we then develop a genetic algorithm to simultaneously discretize multi-attributes according to entropy criterion. The effectiveness of the proposed method is demonstrated with the experiment on a real data set.
Keywords :
data mining; entropy; genetic algorithms; pattern clustering; association rule; data mining; density-based clustering technique; entropy criterion; genetic algorithm; multivariate discretization method; pattern clustering; Association rules; Clustering algorithms; Data mining; Electronic mail; Entropy; Frequency; Genetic algorithms; Information processing; Itemsets; Testing; association rule; clustering; data mining; discretization; genetic algorithm;
Conference_Titel :
Information Processing, 2009. APCIP 2009. Asia-Pacific Conference on
Conference_Location :
Shenzhen
Print_ISBN :
978-0-7695-3699-6
DOI :
10.1109/APCIP.2009.102