• DocumentCode
    2984811
  • Title

    Dynamic Boolean Matrix Factorizations

  • Author

    Miettinen, Pauli

  • Author_Institution
    Max Planck Inst. for Inf., Saarbrucken, Germany
  • fYear
    2012
  • fDate
    10-13 Dec. 2012
  • Firstpage
    519
  • Lastpage
    528
  • Abstract
    Boolean matrix factorization is a method to decompose a binary matrix into two binary factor matrices. Akin to other matrix factorizations, the factor matrices can be used for various data analysis tasks. Many (if not most) real-world data sets are dynamic, though, meaning that new information is recorded over time. Incorporating this new information into the factorization can require a re-computation of the factorization -- something we cannot do if we want to keep our factorization up-to-date after each update. This paper proposes a method to dynamically update the Boolean matrix factorization when new data is added to the data base. This method is extended with a mechanism to improve the factorization with a trade-off in speed of computation. The method is tested with a number of real-world and synthetic data sets including studying its efficiency against off-line methods. The results show that with good initialization the proposed online and dynamic methods can beat the state-of-the-art offline Boolean matrix factorization algorithms.
  • Keywords
    Boolean algebra; data analysis; database management systems; matrix decomposition; binary factor matrices; binary matrix; data analysis tasks; dynamic Boolean matrix factorizations; dynamic methods; off-line methods; online methods; real-world data sets; recomputation; state-of-the-art offline Boolean matrix factorization algorithms; synthetic data sets; Algorithm design and analysis; Approximation methods; Data mining; Encoding; Heuristic algorithms; Matrices; Vectors; Boolean matrix factorization; Dynamic algorithms; On-line algorithms;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining (ICDM), 2012 IEEE 12th International Conference on
  • Conference_Location
    Brussels
  • ISSN
    1550-4786
  • Print_ISBN
    978-1-4673-4649-8
  • Type

    conf

  • DOI
    10.1109/ICDM.2012.118
  • Filename
    6413874