Title :
An algorithm for mining strongly correlated pairs in relational table
Author :
Zhang, Jian-pei ; Li, Qiang
Author_Institution :
Sch. of Comput. Sci. & Technol., Harbin Eng. Univ., China
Abstract :
Given a user-specified minimum correlation threshold and a relational table, the problem of mining all-strong correlated pairs is to find all attribute value pairs with Pearson´s correlation coefficients above the minimum correlation threshold. However, algorithms developed for transaction database will generate invalid candidate pairs due to fundamental property of the itemsets in relational table (i.e. 1NF, they cannot contain more that one item per table column) and hence encounter additional and unnecessary computation cost. In this paper, using this property, the join step in the candidate generation phase is adapted to reflect this and to prune candidate set by not taking into itemsets which are not in 1NF. Furthermore, we propose other techniques to reduce the number of candidate pairs that are to be examined in the refinement step, even when the upper bound based pruning technique is useless in case of very low correlation threshold. Experimental results from real data sets exhibit that our algorithm can produce smaller candidate set and be faster than previous algorithms.
Keywords :
correlation methods; data mining; relational databases; Pearson correlation coefficients1; all attribute value pairs; all-strong correlated pairs; association rules; data mining; minimum correlation threshold; relational table; strongly correlated pairs; upper bound based pruning; Association rules; Computational efficiency; Computer science; Data mining; Electronic mail; Itemsets; Relational databases; Statistics; Transaction databases; Upper bound; Association Rule; Correlation; Data Mining; Relational Table; Transactions;
Conference_Titel :
Machine Learning and Cybernetics, 2005. Proceedings of 2005 International Conference on
Conference_Location :
Guangzhou, China
Print_ISBN :
0-7803-9091-1
DOI :
10.1109/ICMLC.2005.1527206