• DocumentCode
    2504641
  • Title

    PLT- Positional Lexicographic Tree: A New Structure for Mining Frequent Itemsets

  • Author

    Boukerche, Azzedine ; Samarah, Samer

  • Author_Institution
    Sch. of Inf. Technol. & Eng., Ottawa Univ., Ont.
  • fYear
    0
  • fDate
    0-0 0
  • Firstpage
    135
  • Lastpage
    141
  • Abstract
    Association rules have proved their influence in different industrial fields, where their goal is to identify the relations existing among the events that are stored in large databases. However, in order to enumerate the association rules, there is a need to identify the frequent set of itemsets (i.e. those events that occur together in a sufficient number of transactions). In this paper, a new representation structure for the data stored in any transactional database is proposed. This structure, which we refer to as positional lexicographic tree (PLT), provides an efficient mechanism for subset checking based on a summary of the data extracted from the database. This makes PLT a promising tool for most of the existing data mining approaches. Moreover, our proposed PLT structure regulates the data in the database so that they can be applicable to compression and indexing techniques, which makes PLT suitable for supporting large databases. First, we introduce the PLT construction process, then highlight the different mining approaches that can be modulated to take advantage of PLT. We then present our algorithm and finally prove its correctness
  • Keywords
    data mining; database indexing; tree data structures; association rules; data extraction; data storage; frequent itemsets mining; large databases; positional lexicographic tree; Association rules; Data engineering; Data mining; Frequency; Indexing; Industrial relations; Information technology; Itemsets; Modular construction; Transaction databases;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel Processing Workshops, 2006. ICPP 2006 Workshops. 2006 International Conference on
  • Conference_Location
    Columbus, OH
  • ISSN
    1530-2016
  • Print_ISBN
    0-7695-2637-3
  • Type

    conf

  • DOI
    10.1109/ICPPW.2006.63
  • Filename
    1690694