DocumentCode :
54509
Title :
Non-Negative Matrix Factorization with Auxiliary Information on Overlapping Groups
Author :
Shiga, Motoki ; Mamitsuka, Hiroshi
Author_Institution :
Dept. of Electr., Gifu Univ., Gifu, Japan
Volume :
27
Issue :
6
fYear :
2015
fDate :
June 1 2015
Firstpage :
1615
Lastpage :
1628
Abstract :
Matrix factorization is useful to extract the essential low-rank structure from a given matrix and has been paid increasing attention. A typical example is non-negative matrix factorization (NMF), which is one type of unsupervised learning, having been successfully applied to a variety of data including documents, images and gene expression, where their values are usually non-negative. We propose a new model of NMF which is trained by using auxiliary information of overlapping groups. This setting is very reasonable in many applications, a typical example being gene function estimation where functional gene groups are heavily overlapped with each other. To estimate true groups from given overlapping groups efficiently, our model incorporates latent matrices with the regularization term using a mixed norm. This regularization term allows group-wise sparsity on the optimized low-rank structure. The latent matrices and other parameters are efficiently estimated by a block coordinate gradient descent method. We empirically evaluated the performance of our proposed model and algorithm from a variety of viewpoints, comparing with four methods including MMF for auxiliary graph information, by using both synthetic and real world document and gene expression data sets.
Keywords :
genetic algorithms; gradient methods; graph theory; matrix decomposition; performance evaluation; unsupervised learning; NMF; auxiliary graph information; auxiliary information; block coordinate gradient descent method; functional gene group; gene expression data set; gene function estimation; group-wise sparsity; latent matrices; mixed norm; nonnegative matrix factorization; optimized low-rank structure; overlapping group; performance evaluation; real world document; regularization term; synthetic document; unsupervised learning; Data mining; Gene expression; Jacobian matrices; Optimization; Semisupervised learning; Sparse matrices; Vectors; Non-negative matrix factorization; auxiliary information; semi-supervised learning; sparse structured norm;
fLanguage :
English
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1041-4347
Type :
jour
DOI :
10.1109/TKDE.2014.2373361
Filename :
6965589
Link To Document :
بازگشت