DocumentCode :
1158074
Title :
Efficient Tree Structures for High Utility Pattern Mining in Incremental Databases
Author :
Ahmed, Chowdhury Farhan ; Tanbeer, Syed Khairuzzaman ; Jeong, Byeong-Soo ; Lee, Young-Koo
Author_Institution :
Dept. of Comput. Eng., Kyung Hee Univ., Yongin, South Korea
Volume :
21
Issue :
12
fYear :
2009
Firstpage :
1708
Lastpage :
1721
Abstract :
Recently, high utility pattern (HUP) mining is one of the most important research issues in data mining due to its ability to consider the nonbinary frequency values of items in transactions and different profit values for every item. On the other hand, incremental and interactive data mining provide the ability to use previous data structures and mining results in order to reduce unnecessary calculations when a database is updated, or when the minimum threshold is changed. In this paper, we propose three novel tree structures to efficiently perform incremental and interactive HUP mining. The first tree structure, Incremental HUP Lexicographic Tree (IHUPL-Tree), is arranged according to an item´s lexicographic order. It can capture the incremental data without any restructuring operation. The second tree structure is the IHUP transaction frequency tree (IHUPTF-Tree), which obtains a compact size by arranging items according to their transaction frequency (descending order). To reduce the mining time, the third tree, IHUP-transaction-weighted utilization tree (IHUPTWU-Tree) is designed based on the TWU value of items in descending order. Extensive performance analyses show that our tree structures are very efficient and scalable for incremental and interactive HUP mining.
Keywords :
data mining; tree data structures; data mining; data structures; high utility pattern mining; incremental HUP lexicographic tree; incremental databases; tree structures; Data mining; frequent pattern mining; high utility pattern mining; incremental mining; interactive mining.;
fLanguage :
English
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1041-4347
Type :
jour
DOI :
10.1109/TKDE.2009.46
Filename :
4782959
Link To Document :
بازگشت