Title :
A More Accurate Space Saving Algorithm for Finding the Frequent Items
Author :
Zhou, Jun ; Chen, Ming ; Xiong, Huan
Author_Institution :
Dept. of Comput. Sci., PLAUST, Nanjing, China
Abstract :
The frequent items problem is to process a stream as a stream of items and find all items occurring more than a given fraction of the time. It is one of the most heavily studied problems in data stream mining, dating back to the 1980s. Aiming at higher false positive rate of the Space-Saving algorithm, an LRU-based (Least Recently Used, LRU) improved algorithm with low frequency item pre-eliminated is proposed. Accuracy, stability and adaptability of the improved algorithm have been apparently enhanced. Experimental results indicate that the algorithm can not only be used to find the frequent items, and can be used to estimate the frequency of them precisely. The improved algorithm can be used for online processing both high-speed network packet stream and backbone NetFlow stream.
Keywords :
computer networks; data mining; LRU-based improved algorithm; NetFlow stream; data stream mining; frequent items; high-speed network packet stream; space-saving algorithm; stability; Accuracy; Algorithm design and analysis; Classification algorithms; Complexity theory; Heuristic algorithms; Monitoring; Radiation detectors;
Conference_Titel :
Database Technology and Applications (DBTA), 2010 2nd International Workshop on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4244-6975-8
Electronic_ISBN :
978-1-4244-6977-2
DOI :
10.1109/DBTA.2010.5659027