DocumentCode :
441778
Title :
An algorithm for mining strongly correlated pairs in relational table
Author :
Zhang, Jian-pei ; Li, Qiang
Author_Institution :
Sch. of Comput. Sci. & Technol., Harbin Eng. Univ., China
Volume :
3
fYear :
2005
fDate :
18-21 Aug. 2005
Firstpage :
1631
Abstract :
Given a user-specified minimum correlation threshold and a relational table, the problem of mining all-strong correlated pairs is to find all attribute value pairs with Pearson´s correlation coefficients above the minimum correlation threshold. However, algorithms developed for transaction database will generate invalid candidate pairs due to fundamental property of the itemsets in relational table (i.e. 1NF, they cannot contain more that one item per table column) and hence encounter additional and unnecessary computation cost. In this paper, using this property, the join step in the candidate generation phase is adapted to reflect this and to prune candidate set by not taking into itemsets which are not in 1NF. Furthermore, we propose other techniques to reduce the number of candidate pairs that are to be examined in the refinement step, even when the upper bound based pruning technique is useless in case of very low correlation threshold. Experimental results from real data sets exhibit that our algorithm can produce smaller candidate set and be faster than previous algorithms.
Keywords :
correlation methods; data mining; relational databases; Pearson correlation coefficients1; all attribute value pairs; all-strong correlated pairs; association rules; data mining; minimum correlation threshold; relational table; strongly correlated pairs; upper bound based pruning; Association rules; Computational efficiency; Computer science; Data mining; Electronic mail; Itemsets; Relational databases; Statistics; Transaction databases; Upper bound; Association Rule; Correlation; Data Mining; Relational Table; Transactions;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Cybernetics, 2005. Proceedings of 2005 International Conference on
Conference_Location :
Guangzhou, China
Print_ISBN :
0-7803-9091-1
Type :
conf
DOI :
10.1109/ICMLC.2005.1527206
Filename :
1527206
Link To Document :
بازگشت