Title :
FARM: a framework for exploring mining spaces with multiple attributes
Author :
Perng, Chang-Shing ; Wang, Haixun ; Ma, Sheng ; Hellerstein, Joseph L.
Author_Institution :
IBM Thomas J. Watson Res. Center, Hawthorne, NY, USA
Abstract :
Mining for frequent itemsets typically involves a preprocessing step in which data with multiple attributes are grouped into transactions, and items are defined based on attribute values. We hake observed that such fixed attribute mining can severely constrain the patterns that are discovered. Herein, we introduce mining spaces, a new framework for mining multi-attribute data that includes the discovery of transaction and item definitions (with the exploitation of taxonomies and functional dependencies if they are available). We prove that special downward closure properties (or anti-monotonic property) hold for mining spaces, a result that allows us to construct efficient algorithms for mining patterns without the constraints of fixed attribute mining. We apply our algorithms to real world data collected from a production computer network. The results show that by exploiting the special kinds of downward closure in mining spaces, execution times for mining can be reduced by a factor of three to four
Keywords :
data mining; transaction processing; FARM; downward closure properties; efficient algorithms; fixed attribute mining; frequent itemset mining; item definition discovery; mining space exploration; multiple attributes; production computer network; transaction definition discovery; transactions; Complex networks; Computer network management; Computer networks; Data mining; Itemsets; Pediatrics; Production; Space exploration; Taxonomy;
Conference_Titel :
Data Mining, 2001. ICDM 2001, Proceedings IEEE International Conference on
Conference_Location :
San Jose, CA
Print_ISBN :
0-7695-1119-8
DOI :
10.1109/ICDM.2001.989551