DocumentCode
2171179
Title
Mining frequent itemsets in data streams based on genetic algorithm
Author
Chong Han ; Lijuan Sun ; Jian Guo ; Xiaodong Chen
Author_Institution
Coll. of Comput., Nanjing Univ. of Posts & Telecommun., Nanjing, China
fYear
2013
fDate
17-19 Nov. 2013
Firstpage
748
Lastpage
753
Abstract
Stream data is a very common data type in big data and in many data streams applications, users tend to pay more attention to the mode information of the data streams. So mining frequent patterns in data streams is a significative work. Meanwhile, finding frequent itemests in a data set with predefined fixed support threshold could be seen as an optimization problem. In this paper, the problem of frequent itemsets mining is derived as a non-linear optimization problem, then genetic algorithm is adopted to solve it. Through the formal and bitmap representation of frequent itemsets, the non-linear optimization problem is transformed to 0-1 programming. A set of experimental results show that unlike typical Apriori algorithm, the complexity of time and memory space grows exponentially as the support decrease, our proposed algorithm has a high time and space efficiency even with a very low support.
Keywords
Big Data; computational complexity; data mining; genetic algorithms; nonlinear programming; 0-1 programming; Big Data; apriori algorithm; bitmap representation; data set; data streams applications; data type; frequent itemsets mining; frequent pattern mining; genetic algorithm; memory space; nonlinear optimization problem; time complexity; Biological cells; Data mining; Genetic algorithms; Itemsets; Sociology; Statistics; Big data; data streams; frequent itemsets; genetic algorithm;
fLanguage
English
Publisher
ieee
Conference_Titel
Communication Technology (ICCT), 2013 15th IEEE International Conference on
Conference_Location
Guilin
Type
conf
DOI
10.1109/ICCT.2013.6820474
Filename
6820474
Link To Document