Title :
Dynamic Boolean Matrix Factorizations
Author :
Miettinen, Pauli
Author_Institution :
Max Planck Inst. for Inf., Saarbrucken, Germany
Abstract :
Boolean matrix factorization is a method to decompose a binary matrix into two binary factor matrices. Akin to other matrix factorizations, the factor matrices can be used for various data analysis tasks. Many (if not most) real-world data sets are dynamic, though, meaning that new information is recorded over time. Incorporating this new information into the factorization can require a re-computation of the factorization -- something we cannot do if we want to keep our factorization up-to-date after each update. This paper proposes a method to dynamically update the Boolean matrix factorization when new data is added to the data base. This method is extended with a mechanism to improve the factorization with a trade-off in speed of computation. The method is tested with a number of real-world and synthetic data sets including studying its efficiency against off-line methods. The results show that with good initialization the proposed online and dynamic methods can beat the state-of-the-art offline Boolean matrix factorization algorithms.
Keywords :
Boolean algebra; data analysis; database management systems; matrix decomposition; binary factor matrices; binary matrix; data analysis tasks; dynamic Boolean matrix factorizations; dynamic methods; off-line methods; online methods; real-world data sets; recomputation; state-of-the-art offline Boolean matrix factorization algorithms; synthetic data sets; Algorithm design and analysis; Approximation methods; Data mining; Encoding; Heuristic algorithms; Matrices; Vectors; Boolean matrix factorization; Dynamic algorithms; On-line algorithms;
Conference_Titel :
Data Mining (ICDM), 2012 IEEE 12th International Conference on
Conference_Location :
Brussels
Print_ISBN :
978-1-4673-4649-8
DOI :
10.1109/ICDM.2012.118