DocumentCode :
657
Title :
Discovering Temporal Change Patterns in the Presence of Taxonomies
Author :
Cagliero, Luca
Author_Institution :
Dipt. di Autom. e Inf., Politec. di Torino, Turin, Italy
Volume :
25
Issue :
3
fYear :
2013
fDate :
Mar-13
Firstpage :
541
Lastpage :
555
Abstract :
Frequent itemset mining is a widely exploratory technique that focuses on discovering recurrent correlations among data. The steadfast evolution of markets and business environments prompts the need of data mining algorithms to discover significant correlation changes in order to reactively suit product and service provision to customer needs. Change mining, in the context of frequent itemsets, focuses on detecting and reporting significant changes in the set of mined itemsets from one time period to another. The discovery of frequent generalized itemsets, i.e., itemsets that 1) frequently occur in the source data, and 2) provide a high-level abstraction of the mined knowledge, issues new challenges in the analysis of itemsets that become rare, and thus are no longer extracted, from a certain point. This paper proposes a novel kind of dynamic pattern, namely the History GENeralized Pattern (HIGEN), that represents the evolution of an itemset in consecutive time periods, by reporting the information about its frequent generalizations characterized by minimal redundancy (i.e., minimum level of abstraction) in case it becomes infrequent in a certain time period. To address HIGEN mining, it proposes HIGEN MINER, an algorithm that focuses on avoiding itemset mining followed by postprocessing by exploiting a support-driven itemset generalization approach. To focus the attention on the minimally redundant frequent generalizations and thus reduce the amount of the generated patterns, the discovery of a smart subset of HIGENs, namely the NONREDUNDANT HIGENs, is addressed as well. Experiments performed on both real and synthetic datasets show the efficiency and the effectiveness of the proposed approach as well as its usefulness in a real application context.
Keywords :
business data processing; data mining; marketing data processing; pattern recognition; HIGEN; business environments; data mining algorithms; discovering temporal change patterns; frequent itemset mining; history generalized pattern; market environments; source data; taxonomies presence; Context awareness; Data mining; Information retrieval; Itemsets; Search methods; Taxonomy; Data mining; mining methods and algorithms;
fLanguage :
English
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1041-4347
Type :
jour
DOI :
10.1109/TKDE.2011.233
Filename :
6081868
Link To Document :
بازگشت