• DocumentCode
    3013131
  • Title

    MAFIA: a maximal frequent itemset algorithm for transactional databases

  • Author

    Burdick, Doug ; Calimlim, Manuel ; Gehrke, Johannes

  • Author_Institution
    Dept. of Comput. Sci., Cornell Univ., Ithaca, NY, USA
  • fYear
    2001
  • fDate
    2001
  • Firstpage
    443
  • Lastpage
    452
  • Abstract
    We present a new algorithm for mining maximal frequent itemsets from a transactional database. Our algorithm is especially efficient when the itemsets in the database are very long. The search strategy of our algorithm integrates a depth-first traversal of the itemset lattice with effective pruning mechanisms. Our implementation of the search strategy combines a vertical bitmap representation of the database with an efficient relative bitmap compression schema. In a thorough experimental analysis of our algorithm on real data, we isolate the effect of the individual components of the algorithm. Our performance numbers show that our algorithm outperforms previous work by a factor of three to five
  • Keywords
    data compression; data mining; database theory; transaction processing; tree searching; MAFIA; depth-first traversal; experimental analysis; itemset lattice; maximal frequent itemset algorithm; maximal frequent itemset mining; pruning mechanisms; real data; relative bitmap compression schema; search strategy; transactional database; transactional databases; vertical bitmap representation; Algorithm design and analysis; Association rules; Computer science; Data mining; Itemsets; Lattices; Pattern analysis; Performance analysis; Transaction databases; Web pages;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering, 2001. Proceedings. 17th International Conference on
  • Conference_Location
    Heidelberg
  • ISSN
    1063-6382
  • Print_ISBN
    0-7695-1001-9
  • Type

    conf

  • DOI
    10.1109/ICDE.2001.914857
  • Filename
    914857