• DocumentCode
    2457892
  • Title

    Integrating Frequent Pattern Mining from Multiple Data Domains for Classification

  • Author

    Patel, Dhaval ; Hsu, Wynne ; Lee, Mong Li

  • Author_Institution
    Nat. Univ. of Singapore, Singapore, Singapore
  • fYear
    2012
  • fDate
    1-5 April 2012
  • Firstpage
    1001
  • Lastpage
    1012
  • Abstract
    Many frequent pattern mining algorithms have been developed for categorical, numerical, time series, or interval data. However, little attention has been given to integrate these algorithms so as to mine frequent patterns from multiple domain datasets for classification. In this paper, we introduce the notion of a heterogenous pattern to capture the associations among different kinds of data. We propose a unified framework for mining multiple domain datasets and design an iterative algorithm called HTMiner. HTMiner discovers essential heterogenous patterns for classification and performs instance elimination. This instance elimination step reduces the problem size progressively by removing training instances which are correctly covered by the discovered essential heterogenous pattern. Experiments on two real world datasets show that the HTMiner is efficient and can significantly improve the classification accuracy.
  • Keywords
    data mining; iterative methods; pattern classification; HTMiner; categorical data; classification; frequent pattern mining integration; heterogenous pattern discovery; instance elimination step; interval data; iterative algorithm; multiple domain dataset mining; numerical data; time series data; Accuracy; Algorithm design and analysis; Blood pressure; Data mining; Indexes; Itemsets; Time series analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering (ICDE), 2012 IEEE 28th International Conference on
  • Conference_Location
    Washington, DC
  • ISSN
    1063-6382
  • Print_ISBN
    978-1-4673-0042-1
  • Type

    conf

  • DOI
    10.1109/ICDE.2012.63
  • Filename
    6228151