• DocumentCode
    51610
  • Title

    LI-MLC: A Label Inference Methodology for Addressing High Dimensionality in the Label Space for Multilabel Classification

  • Author

    Charte, Francisco ; Rivera, Antonio J. ; del Jesus, Maria J. ; Herrera, Francisco

  • Author_Institution
    Dept. of Comput. Sci. & Artificial Intell., Univ. of Granada, Granada, Spain
  • Volume
    25
  • Issue
    10
  • fYear
    2014
  • fDate
    Oct. 2014
  • Firstpage
    1842
  • Lastpage
    1854
  • Abstract
    Multilabel classification (MLC) has generated considerable research interest in recent years, as a technique that can be applied to many real-world scenarios. To process them with binary or multiclass classifiers, methods for transforming multilabel data sets (MLDs) have been proposed, as well as adapted algorithms able to work with this type of data sets. However, until now, few studies have addressed the problem of how to deal with MLDs having a large number of labels. This characteristic can be defined as high dimensionality in the label space (output attributes), in contrast to the traditional high dimensionality problem, which is usually focused on the feature space (by means of feature selection) or sample space (by means of instance selection). The purpose of this paper is to analyze dimensionality in the label space in MLDs, and to present a transformation methodology based on the use of association rules to discover label dependencies. These dependencies are used to reduce the label space, to ease the work of any MLC algorithm, and to infer the deleted labels in a final postprocessing stage. The proposed process is validated in an extensive experimentation with several MLDs and classification algorithms, resulting in a statistically significant improvement of performance in some cases, as will be shown.
  • Keywords
    data mining; inference mechanisms; pattern classification; LI-MLC; MLD; association rules; high label space dimensionality; label dependencies discovery; label inference for MLC; label inference methodology; multilabel classification; multilabel data sets; transformation methodology; Algorithm design and analysis; Correlation; Inference algorithms; Partitioning algorithms; Prediction algorithms; Proposals; Training; Association rules (ARs); data transformation; dimensionality reduction; multilabel classification (MLC); multilabel classification (MLC).;
  • fLanguage
    English
  • Journal_Title
    Neural Networks and Learning Systems, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    2162-237X
  • Type

    jour

  • DOI
    10.1109/TNNLS.2013.2296501
  • Filename
    6704769