DocumentCode :
3439920
Title :
Mining Discrete Patterns via Binary Matrix Factorization
Author :
Peng Jiang ; Heath, M.
Author_Institution :
Dept. of Comput. Sci., Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
fYear :
2013
fDate :
7-10 Dec. 2013
Firstpage :
1129
Lastpage :
1136
Abstract :
In general, binary matrix factorization (BMF) refers to the problem of finding two binary matrices of low rank such that the difference between their matrix product and a given binary matrix is minimal. BMF is an important tool in mining discrete patterns for high-dimensional data. One approximate matrix factor finds the dominant patterns, and the other shows how the original patterns are represented by the dominant ones. The problem of determining the exact optimal solution is NP-hard. We show that BMF is closely related with k-means clustering and propose a clustering approach for BMF. We prove that our approach has approximation ratio of 2. We further propose a randomized clustering algorithm that chooses k cluster centroids randomly based on preassigned probabilities to each point. The randomized clustering algorithm works well for large k. We experimentally demonstrate the nice theoretical properties of BMF on applications in pattern extraction and association rule mining.
Keywords :
data mining; matrix decomposition; optimisation; pattern clustering; probability; randomised algorithms; BMF; NP-hard problem; approximate matrix factor; association rule mining; binary matrix factorization; discrete pattern mining; exact optimal solution; high-dimensional data; k cluster centroids; k-means clustering approach; matrix product; pattern extraction; preassigned probabilities; randomized clustering algorithm; Approximation algorithms; Approximation methods; Clustering algorithms; Data mining; Matrix decomposition; Partitioning algorithms; Vectors; Binary matrix factorization; approximation algorithm; association rule mining; k-means clustering; pattern extraction;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining Workshops (ICDMW), 2013 IEEE 13th International Conference on
Conference_Location :
Dallas, TX
Print_ISBN :
978-1-4799-3143-9
Type :
conf
DOI :
10.1109/ICDMW.2013.46
Filename :
6754051
Link To Document :
بازگشت