DocumentCode
2727591
Title
Pairwise Constraints-Guided Non-negative Matrix Factorization for Document Clustering
Author
Yang, Yu-Jiu ; Hu, Bao-Gang
fYear
2007
fDate
2-5 Nov. 2007
Firstpage
250
Lastpage
256
Abstract
Nonnegative Matrix Factorization (NMF) has been proven to be effective in text mining. However, since NMF is a well-known unsupervised components analysis technique, the existing NMF method can not deal with prior constraints, which are beneficial to clustering or classification tasks. In this paper, we address the text clustering problem via a novel strategy, called Pairwise Constraintsguided Non-negative Matrix Factorization (PCNMF for short). Differing from the traditional NMF method, the proposed method can capture the available abundance prior constraints in original space, which result in more effective for clustering or information retrieval. Therefore, PCNMF enforces the discriminative capability in the reduced space. Utilizing the appropriate transformation, PCNMF represents as a new optimization problem, which can be efficiently solved by an iterative approach. The cluster membership of each document can be easily determined as the standard NMF. Empirical studies based on Benchmark document corpus demonstrate appealing results.
Keywords
Automation; Computational efficiency; Data mining; Feature extraction; Information retrieval; Internet; Iterative methods; Laboratories; Pattern recognition; Text mining;
fLanguage
English
Publisher
ieee
Conference_Titel
Web Intelligence, IEEE/WIC/ACM International Conference on
Conference_Location
Fremont, CA
Print_ISBN
978-0-7695-3026-0
Type
conf
DOI
10.1109/WI.2007.66
Filename
4427095
Link To Document