DocumentCode
3166550
Title
Solving Consensus and Semi-supervised Clustering Problems Using Nonnegative Matrix Factorization
Author
Li, Tao ; Ding, Chris ; Jordan, Michael I.
Author_Institution
Florida Int. Univ., Miami
fYear
2007
fDate
28-31 Oct. 2007
Firstpage
577
Lastpage
582
Abstract
Consensus clustering and semi-supervised clustering are important extensions of the standard clustering paradigm. Consensus clustering (also known as aggregation of clustering) can improve clustering robustness, deal with distributed and heterogeneous data sources and make use of multiple clustering criteria. Semi-supervised clustering can integrate various forms of background knowledge into clustering. In this paper, we show how consensus and semi-supervised clustering can be formulated within the framework of nonnegative matrix factorization (NMF). We show that this framework yields NMF-based algorithms that are: (1) extremely simple to implement; (2) provably correct and provably convergent. We conduct a wide range of comparative experiments that demonstrate the effectiveness of this NMF-based approach.
Keywords
matrix decomposition; pattern clustering; consensus clustering; nonnegative matrix factorization; semisupervised clustering problems; Clustering algorithms; Data analysis; Data mining; Engineering profession; Matrix decomposition; Partitioning algorithms; Robustness; Statistics; USA Councils;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining, 2007. ICDM 2007. Seventh IEEE International Conference on
Conference_Location
Omaha, NE
ISSN
1550-4786
Print_ISBN
978-0-7695-3018-5
Type
conf
DOI
10.1109/ICDM.2007.98
Filename
4470293
Link To Document