DocumentCode :
24
Title :
Transductive Multilabel Learning via Label Set Propagation
Author :
Xiangnan Kong ; Ng, Michael K. ; Zhi-Hua Zhou
Author_Institution :
Nat. Key Lab. for Novel Software Technol., Nanjing Univ., Nanjing, China
Volume :
25
Issue :
3
fYear :
2013
fDate :
Mar-13
Firstpage :
704
Lastpage :
719
Abstract :
The problem of multilabel classification has attracted great interest in the last decade, where each instance can be assigned with a set of multiple class labels simultaneously. It has a wide variety of real-world applications, e.g., automatic image annotations and gene function analysis. Current research on multilabel classification focuses on supervised settings which assume existence of large amounts of labeled training data. However, in many applications, the labeling of multilabeled data is extremely expensive and time consuming, while there are often abundant unlabeled data available. In this paper, we study the problem of transductive multilabel learning and propose a novel solution, called Trasductive Multilabel Classification (TraM), to effectively assign a set of multiple labels to each instance. Different from supervised multilabel learning methods, we estimate the label sets of the unlabeled instances effectively by utilizing the information from both labeled and unlabeled data. We first formulate the transductive multilabel learning as an optimization problem of estimating label concept compositions. Then, we derive a closed-form solution to this optimization problem and propose an effective algorithm to assign label sets to the unlabeled instances. Empirical studies on several real-world multilabel learning tasks demonstrate that our TraM method can effectively boost the performance of multilabel classification by using both labeled and unlabeled data.
Keywords :
data mining; learning (artificial intelligence); optimisation; pattern classification; TRAM method; closed-form solution; data mining; label concept composition estimation; label set propagation; labeled training data; multilabel classification problem; multiple class labels; optimization problem; supervised multilabel learning methods; transductive multilabel learning; trasductive multilabel classification; unlabeled data; unlabeled instances; Closed-form solution; Data mining; Learning systems; Machine learning; Optimization; Semisupervised learning; Training data; Data mining; machine learning; multilabel learning; semi-supervised learning; transductive learning; unlabeled data;
fLanguage :
English
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1041-4347
Type :
jour
DOI :
10.1109/TKDE.2011.141
Filename :
5936063
Link To Document :
بازگشت