DocumentCode :
2457892
Title :
Integrating Frequent Pattern Mining from Multiple Data Domains for Classification
Author :
Patel, Dhaval ; Hsu, Wynne ; Lee, Mong Li
Author_Institution :
Nat. Univ. of Singapore, Singapore, Singapore
fYear :
2012
fDate :
1-5 April 2012
Firstpage :
1001
Lastpage :
1012
Abstract :
Many frequent pattern mining algorithms have been developed for categorical, numerical, time series, or interval data. However, little attention has been given to integrate these algorithms so as to mine frequent patterns from multiple domain datasets for classification. In this paper, we introduce the notion of a heterogenous pattern to capture the associations among different kinds of data. We propose a unified framework for mining multiple domain datasets and design an iterative algorithm called HTMiner. HTMiner discovers essential heterogenous patterns for classification and performs instance elimination. This instance elimination step reduces the problem size progressively by removing training instances which are correctly covered by the discovered essential heterogenous pattern. Experiments on two real world datasets show that the HTMiner is efficient and can significantly improve the classification accuracy.
Keywords :
data mining; iterative methods; pattern classification; HTMiner; categorical data; classification; frequent pattern mining integration; heterogenous pattern discovery; instance elimination step; interval data; iterative algorithm; multiple domain dataset mining; numerical data; time series data; Accuracy; Algorithm design and analysis; Blood pressure; Data mining; Indexes; Itemsets; Time series analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering (ICDE), 2012 IEEE 28th International Conference on
Conference_Location :
Washington, DC
ISSN :
1063-6382
Print_ISBN :
978-1-4673-0042-1
Type :
conf
DOI :
10.1109/ICDE.2012.63
Filename :
6228151
Link To Document :
بازگشت