Title :
Adaptive Frequency Counting over Bursty Data Streams
Author :
Lin, Bill ; Ho, Wai-Shing ; Kao, Ben ; Chui, Chun-Kit
Author_Institution :
Dept. of Comput. Sci., Hong Kong Univ.
fDate :
March 1 2007-April 5 2007
Abstract :
We investigate the problem of frequent itemset mining over a data stream with bursty traffic. In many modern applications, data arrives at a system as a continuous stream of transactions. In many cases, the arrival rate of transactions fluctuates wildly. Traditional stream mining algorithms, such as Lossy Counting (LC), were generally designed to handle data streams with steady data arrival rates. We show that LC suffers significant loss of accuracy when the data stream is bursty. We propose the Adaptive Frequency Counting algorithm (AFC) to handle bursty data. AFC has a feedback mechanism that dynamically adjusts the mining speed to cope with the changing data arrival rate. Through extensive experiments, we show that AFC outperforms LC under bursty traffics in terms of the accuracy of the set of frequent itemsets
Keywords :
data mining; adaptive frequency counting algorithm; bursty data streams; bursty traffic; data stream handling; frequent itemset mining; Automatic frequency control; Computational intelligence; Computer science; Data analysis; Data mining; Electronic mail; Itemsets; Monitoring; Performance analysis; Telecommunication traffic;
Conference_Titel :
Computational Intelligence and Data Mining, 2007. CIDM 2007. IEEE Symposium on
Conference_Location :
Honolulu, HI
Print_ISBN :
1-4244-0705-2
DOI :
10.1109/CIDM.2007.368918