Title :
An efficient approach to discovering knowledge from large databases
Author :
Yen, Show-Jane ; Chen, Arbee L P
Author_Institution :
Dept. of Comput. Sci., Nat. Tsing Hua Univ., Hsinchu, Taiwan
Abstract :
We study two problems: mining association rules and mining sequential patterns in a large database of customer transactions. The problem of mining association rules focuses on discovering large itemsets where a large itemset is a group of items which appear together in a sufficient number of transactions; while the problem of mining sequential patterns focuses on discovering large sequences where a large sequence is an ordered list of sets of items which appear in a sufficient number of transactions. We present efficient graph based algorithms to solve these problems. The algorithms construct an association graph to indicate the associations between items and then traverse the graph to generate large itemsets and large sequences, respectively. Our algorithms need to scan the database only once. Empirical evaluations show that our algorithms outperform other algorithms which need to make multiple passes over the database
Keywords :
deductive databases; graph theory; knowledge acquisition; transaction processing; very large databases; association graph; association rule mining; customer transactions; graph based algorithms; knowledge discovery; large databases; large itemsets; multiple passes; sequential pattern mining; sequential patterns; Association rules; Computer science; Contracts; Councils; Dairy products; Data mining; Itemsets; Quality management; Query processing; Transaction databases;
Conference_Titel :
Parallel and Distributed Information Systems, 1996., Fourth International Conference on
Conference_Location :
Miami Beach, FL
Print_ISBN :
0-8186-7475X
DOI :
10.1109/PDIS.1996.568663