Title :
Mining and representing rare association rules through the use of genetic programming
Author :
Luna, José María ; Romero, José Raúl ; Ventura, Sebastián
Author_Institution :
Dept. of Comput. Sci. & Numerical Anal., Univ. of Cordoba, Cordoba, Spain
Abstract :
Whereas the extraction of frequent patterns has focused the major researches in association rule mining, the requirements of reliable rules that do not frequently appear is taking an increasing interest in a great number of areas. This field has not been explored in depth and most algorithms for mining infrequent association rules follow an exhaustive search methodology, which hampers the extracting process because of the size of the datasets. The importance of discovering patterns that do not frequently appear in a dataset and the promising results obtained when using evolutionary proposals in the field of frequent pattern mining motivates the evolutionary proposal for discovering rare association rules presented in this paper. Here, a context-free grammar is described and applied to adapt individuals to each particular problem or domain. The use of both an evolutionary approach and a context-free grammar reduces the memory requirements and provides the possibility of extracting any kind of rules, respectively. The experimental study shows that this proposal obtains a set of reliable infrequent rules in a short period of time.
Keywords :
context-free grammars; data mining; genetic algorithms; context-free grammar; evolutionary proposals; extracting process; frequent pattern mining; genetic programming; infrequent association rule mining; rare association rule mining; rare association rule representation; search methodology; Association rules; Genetics; Grammar; Production; Proposals; Reliability; Grammar-based Evolutionary Algorithms; Infrequent Rule Mining; Rare Association Rule;
Conference_Titel :
Nature and Biologically Inspired Computing (NaBIC), 2011 Third World Congress on
Conference_Location :
Salamanca
Print_ISBN :
978-1-4577-1122-0
DOI :
10.1109/NaBIC.2011.6089422