Title :
Semisupervised Nonnegative Matrix Factorization for learning the semantics
Author :
Bin Shen ; Datbayev, Z. ; Makhambetov, O.
Author_Institution :
Dept. of Comput. Sci., Purdue Univ., West Lafayette, IN, USA
Abstract :
In real world there are a lot of unlabeled data, and relatively few labeled data. Unlabeled data help to learn a statistical model that can fully describe the global property of data, while labeled data help to minimize the gap between the statistical property and human beings´ perception, i.e. labeled data can help to learn the semantics. Nonnegative Matrix Factorization is a popular technique in data analysis, since a lot of real world data are nonnegative. However, traditional NMF is an unsupervised learning algorithm, which means that it cannot make use of the label information. To enable NMF to make use of both labeled and unlabeled data samples, we propose a novel semisupervised Nonnegative Matrix Factorization technique for learning the semantics. The proposed algorithm extracts prior information from the labeled data, and then uses it to guide the later processing. Experimental results with different settings prove the efficacy of the proposed algorithm.
Keywords :
data analysis; learning (artificial intelligence); matrix decomposition; statistical analysis; data analysis; labeled data; semantics learning; semisupervised nonnegative matrix factorization; statistical model; unlabeled data; unsupervised learning algorithm;
Conference_Titel :
Soft Computing and Intelligent Systems (SCIS) and 13th International Symposium on Advanced Intelligent Systems (ISIS), 2012 Joint 6th International Conference on
Conference_Location :
Kobe
Print_ISBN :
978-1-4673-2742-8
DOI :
10.1109/SCIS-ISIS.2012.6505160