Title :
LaCova: A Tree-Based Multi-label Classifier Using Label Covariance as Splitting Criterion
Author :
Al-Otaibi, Reem ; Kull, Meelis ; Flach, Peter
Author_Institution :
Intell. Syst. Lab., Univ. of Bristol, Bristol, UK
Abstract :
Dealing with multiple labels is a supervised learning problem of increasing importance. Multi-label classifiers face the challenge of exploiting correlations between labels. While in existing work these correlations are often modelled globally, in this paper we use the divide-and-conquer approach of decision trees which enables taking local decisions about how best to model label dependency. The resulting algorithm establishes a tree-based multi-label classifier called LaCova which dynamically interpolates between two well-known baseline methods: Binary Relevance, which assumes all labels independent, and Label Power set, which learns the joint label distribution. The key idea is a splitting criterion based on the label covariance matrix at that node, which allows us to choose between a horizontal split (branching on a feature) and a vertical split (separating the labels). Empirical results on 12 data sets show strong performance of the proposed method, particularly on data sets with hundreds of labels.
Keywords :
covariance matrices; decision trees; divide and conquer methods; learning (artificial intelligence); pattern classification; LaCova; binary relevance; decision trees; divide-and-conquer approach; horizontal split; joint label distribution; label covariance matrix; label dependency; label powerset; local decisions; splitting criterion; supervised learning problem; tree-based multilabel classifier; vertical split; Accuracy; Correlation; Covariance matrices; Decision trees; Educational institutions; Loss measurement; Prediction algorithms; covariance matrix; decision trees; multi-label learning; splitting criteria;
Conference_Titel :
Machine Learning and Applications (ICMLA), 2014 13th International Conference on
Conference_Location :
Detroit, MI
DOI :
10.1109/ICMLA.2014.17