Title :
Mining indirect associations over data streams
Author :
Chen, Chun-Hao ; Lin, Wen-Yang ; Chen, Yi-Ching ; Li, He-Yi
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Tamkang Univ., Taipei, Taiwan
Abstract :
Indirect association is a new type of infrequent pattern, which provides a new way for interpreting the value of infrequent patterns and can effectively reduce the number of uninteresting infrequent patterns. The concept of indirect association is to “indirectly”connect two rarely co-occurred items via a frequent itemset called mediator, and if appropriately utilized it can help to identify real interesting “infrequent itempairs” from databases. All of the literature on indirect association mining, to our best knowledge, is confined to the traditional, relatively static database environment; no research work has been conducted on mining indirect associations over data streams. In this paper, we propose an approach, namely MIA-LM (Mining Indirect Association over a Landmark Model) algorithm, for mining indirect associations over data streams. The proposed method can not only discover indirect associations over data streams efficiently, but also guarantee the error of derived itemsets not to exceed a user-specified parameter. Experiments on real web-click stream are also made to show the effectiveness and efficiency of the proposed approach.
Keywords :
algorithm theory; data mining; data stream; indirect association mining; landmark model algorithm; mediator; static database; user-specified parameter; data stream mining; indirect association; indirect itempair; landmark window model;
Conference_Titel :
System Science and Engineering (ICSSE), 2010 International Conference on
Conference_Location :
Taipei
Print_ISBN :
978-1-4244-6472-2
DOI :
10.1109/ICSSE.2010.5551716