• DocumentCode
    245080
  • Title

    Fast Algorithms for Frequent Itemset Mining from Uncertain Data

  • Author

    Leung, Carson Kai-Sang ; MacKinnon, Richard Kyle ; Tanbeer, Syed K.

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Manitoba, Winnipeg, MB, Canada
  • fYear
    2014
  • fDate
    14-17 Dec. 2014
  • Firstpage
    893
  • Lastpage
    898
  • Abstract
    The majority of existing data mining algorithms mine frequent item sets from precise databases. A well-known algorithm is FP-growth, which builds a compact FP-tree structure to capture important contents of the database and mines frequent item sets from the FP-tree. However, there are situations in which data are uncertain. In recent years, researchers have paid attention to frequent item set mining from uncertain databases. UFP-growth is one of the frequently cited algorithms for mining uncertain data. However, the corresponding UFP-tree structure can be large. Other tree structures for handling uncertain data may achieve compactness at the expense of looser upper bounds on expected supports. To solve this problem, we propose two compact tree structures which capture uncertain data with tighter upper bounds than existing tree structures. We also designed two algorithms that mine frequent item sets from our proposed trees. Our experimental results show the tightness of bounds to expected supports provided by these algorithms.
  • Keywords
    data handling; data mining; tree data structures; FP-growth; UF-growth algorithm; UF-tree structure; compact FP-tree structure; data mining algorithms; fast algorithms; frequent itemset mining; looser upper bounds; tightened upper bounds; uncertain data handling; Algorithm design and analysis; Data mining; Electron tubes; Integrated circuits; Itemsets; Upper bound; Association analysis; data mining algorithms; frequent patterns; tree structures; uncertain data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining (ICDM), 2014 IEEE International Conference on
  • Conference_Location
    Shenzhen
  • ISSN
    1550-4786
  • Print_ISBN
    978-1-4799-4303-6
  • Type

    conf

  • DOI
    10.1109/ICDM.2014.146
  • Filename
    7023419