DocumentCode :
3453171
Title :
A More Accurate Space Saving Algorithm for Finding the Frequent Items
Author :
Zhou, Jun ; Chen, Ming ; Xiong, Huan
Author_Institution :
Dept. of Comput. Sci., PLAUST, Nanjing, China
fYear :
2010
fDate :
27-28 Nov. 2010
Firstpage :
1
Lastpage :
5
Abstract :
The frequent items problem is to process a stream as a stream of items and find all items occurring more than a given fraction of the time. It is one of the most heavily studied problems in data stream mining, dating back to the 1980s. Aiming at higher false positive rate of the Space-Saving algorithm, an LRU-based (Least Recently Used, LRU) improved algorithm with low frequency item pre-eliminated is proposed. Accuracy, stability and adaptability of the improved algorithm have been apparently enhanced. Experimental results indicate that the algorithm can not only be used to find the frequent items, and can be used to estimate the frequency of them precisely. The improved algorithm can be used for online processing both high-speed network packet stream and backbone NetFlow stream.
Keywords :
computer networks; data mining; LRU-based improved algorithm; NetFlow stream; data stream mining; frequent items; high-speed network packet stream; space-saving algorithm; stability; Accuracy; Algorithm design and analysis; Classification algorithms; Complexity theory; Heuristic algorithms; Monitoring; Radiation detectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Database Technology and Applications (DBTA), 2010 2nd International Workshop on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4244-6975-8
Electronic_ISBN :
978-1-4244-6977-2
Type :
conf
DOI :
10.1109/DBTA.2010.5659027
Filename :
5659027
Link To Document :
بازگشت