Title :
Enhancing Accuracy of Multilabel Classification by Extracting Hierarchies
Author :
Ulanov, Alexander ; Sapozhnikov, German ; Lyubomishchenko, Nickolay ; Polutin, Vladimir ; Shevlyakov, Georgy
Author_Institution :
HP Labs. Russia, St. Petersburg, Russia
fDate :
Aug. 29 2011-Sept. 2 2011
Abstract :
A novel algorithm of extracting hierarchies with the maximal F-measure for improving multilabel classification performance, the PHOCS, builds Predicted Hierarchy Of Classifiers. Nodes contain classifiers, and each intermediate node corresponds to a set of labels, and a leaf node to a single label. Any classifier in the extracted hierarchy deals with a considerably smaller set of labels as compared to the number L of labels, and with a more balanced training distribution. This leads to an improved classification performance. Our method has linear training and logarithmic testing complexity with respect to L. The experiment was conducted on 4 multilabel datasets and it has confirmed the effectiveness of the PHOCS algorithm.
Keywords :
data mining; pattern classification; PHOCS algorithm; hierarchy extraction; linear training; logarithmic testing complexity; maximal F-measure; multilabel classification; Accuracy; Buildings; Clustering algorithms; Complexity theory; Manganese; Prediction algorithms; Training; hierarchy extraction; multilabel classification; taxonomy generation; text mining;
Conference_Titel :
Database and Expert Systems Applications (DEXA), 2011 22nd International Workshop on
Conference_Location :
Toulouse
Print_ISBN :
978-1-4577-0982-1
DOI :
10.1109/DEXA.2011.29