• DocumentCode
    107576
  • Title

    Mandatory Leaf Node Prediction in Hierarchical Multilabel Classification

  • Author

    Wei Bi ; Kwok, James T.

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Hong Kong Univ. of Sci. & Technol., Hong Kong, China
  • Volume
    25
  • Issue
    12
  • fYear
    2014
  • fDate
    Dec. 2014
  • Firstpage
    2275
  • Lastpage
    2287
  • Abstract
    In hierarchical classification, the output labels reside on a tree- or directed acyclic graph (DAG)-structured hierarchy. On testing, the prediction paths of a given test example may be required to end at leaf nodes of the label hierarchy. This is called mandatory leaf node prediction (MLNP) and is particularly useful, when the leaf nodes have much stronger semantic meaning than the internal nodes. However, while there have been a lot of MLNP methods in hierarchical multiclass classification, performing MLNP in hierarchical multilabel classification is difficult. In this paper, we propose novel MLNP algorithms that consider the global label hierarchy structure. We show that the joint posterior probability over all the node labels can be efficiently maximized by dynamic programming for label trees, or greedy algorithm for label DAGs. In addition, both algorithms can be further extended for the minimization of the expected symmetric loss. Experiments are performed on real-world MLNP data sets with label trees and label DAGs. The proposed method consistently outperforms other hierarchical and flat multilabel classification methods.
  • Keywords
    directed graphs; dynamic programming; greedy algorithms; hierarchical systems; pattern classification; probability; trees (mathematics); DAG-structured hierarchy; MLNP; directed acyclic graph; dynamic programming; greedy algorithm; hierarchical multilabel classification; joint posterior probability; label trees; mandatory leaf node prediction; Algorithm design and analysis; Bismuth; Classification algorithms; Heuristic algorithms; Optimization; Prediction algorithms; Support vector machines; Bayesian decision; hierarchical classification; integer linear program (ILP); multilabel classification; multilabel classification.;
  • fLanguage
    English
  • Journal_Title
    Neural Networks and Learning Systems, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    2162-237X
  • Type

    jour

  • DOI
    10.1109/TNNLS.2014.2309437
  • Filename
    6810887