Title :
Compact transaction database for efficient frequent pattern mining
Author :
Wan, Qian ; An, Aijun
Author_Institution :
Dept. of Comput. Sci. & Eng., York Univ., Toronto, Ont., Canada
Abstract :
Mining frequent patterns is one of the fundamental and essential operations in many data mining applications, such as discovering association rules. In this paper, we propose an innovative approach to generating compact transaction databases for efficient frequent pattern mining. It uses a compact tree structure, called CT-tree, to compress the original transactional data. This allows the CT-a priori algorithm, which is revised from the classical a priori algorithm, to generate frequent patterns quickly by skipping the initial database scan and reducing a great amount of I/O time per database scan. Empirical evaluations show that our approach is effective, efficient and promising, while the storage space requirement as well as the mining time can be decreased dramatically on both synthetic and real-world databases.
Keywords :
data mining; transaction processing; tree data structures; CT-a priori algorithm; CT-tree; compact transaction database; compact tree structure; data mining; frequent pattern mining; storage space requirement; transactional data; Algorithm design and analysis; Application software; Association rules; Computer science; Data engineering; Data mining; Itemsets; Transaction databases; Tree data structures; Web pages;
Conference_Titel :
Granular Computing, 2005 IEEE International Conference on
Print_ISBN :
0-7803-9017-2
DOI :
10.1109/GRC.2005.1547372