• DocumentCode
    39773
  • Title

    A Flexible Approach to Finding Representative Pattern Sets

  • Author

    Guimei Liu ; Haojun Zhang ; Limsoon Wong

  • Author_Institution
    Data Analytics Dept., Inst. for Infocomm Res., Singapore, Singapore
  • Volume
    26
  • Issue
    7
  • fYear
    2014
  • fDate
    Jul-14
  • Firstpage
    1562
  • Lastpage
    1574
  • Abstract
    Frequent pattern mining often produces an enormous number of frequent patterns, which imposes a great challenge on visualizing, understanding and further analysis of the generated patterns. This calls for finding a small number of representative patterns to best approximate all other patterns. In this paper, we develop an algorithm called MinRPset to find a minimum representative pattern set with error guarantee. MinRPset produces the smallest solution that we can possibly have in practice under the given problem setting, and it takes a reasonable amount of time to finish when the number of frequent closed patterns is below one million. MinRPset is very space-consuming and time-consuming on some dense datasets when the number of frequent closed patterns is large. To solve this problem, we propose another algorithm called FlexRPset, which provides one extra parameter K to allow users to make a trade-off between result size and efficiency. We adopt an incremental approach to let the users make the trade-off conveniently. Our experiment results show that MinRPset and FlexRPset produce fewer representative patterns than RPlocal-an efficient algorithm that is developed for solving the same problem.
  • Keywords
    data mining; FlexRPset; MinRPset; RPlocal; flexible approach; frequent closed patterns; frequent pattern mining; incremental approach; representative pattern set finding; Approximation algorithms; Approximation methods; Data mining; Generators; Greedy algorithms; Itemsets; Data mining; Database Applications; Database Management; Information Technology and Systems; Representative patterns; frequent pattern summarization; representative patterns;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2013.27
  • Filename
    6427745