• DocumentCode
    188536
  • Title

    Multi-label Classification: Dealing with Imbalance by Combining Labels

  • Author

    Ming Fang ; Yuqi Xiao ; Chongjun Wang ; Junyuan Xie

  • Author_Institution
    Nat. Key Lab. for Novel Software Technol., Nanjing Univ., Nanjing, China
  • fYear
    2014
  • fDate
    10-12 Nov. 2014
  • Firstpage
    233
  • Lastpage
    237
  • Abstract
    Data imbalance is a common problem both in single-label classification (SLC) and multi-label classification (MLC). There is no doubt that the predicting result suffers from this problem. Although, a broad range of studies associate with imbalance problem, most of them focus on SLC and for MLC is relatively less. Actually, this problem arising in MLCis more frequent and complex than in SLC. In this paper, we proceed from dealing with imbalance problem for MLC and propose a new approach called DEML. DEML transforms the whole label set of multi-label dataset into some subsets and each subset is treated as a multi-class dataset with balanced class distribution, which not only addressing imbalance problem but also preserving dataset integrity and consistency. Extensive experiments show that DEML possesses highly competitive performance both in computation and effectiveness.
  • Keywords
    pattern classification; DEML approach; MLC; SLC; class distribution; data imbalance; dataset consistency; dataset integrity; multiclass dataset; multilabel classification; single-label classification; Benchmark testing; Complexity theory; Conferences; Data mining; Entropy; Training; Transforms; combining labels; imbalance; multi-label classification;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Tools with Artificial Intelligence (ICTAI), 2014 IEEE 26th International Conference on
  • Conference_Location
    Limassol
  • ISSN
    1082-3409
  • Type

    conf

  • DOI
    10.1109/ICTAI.2014.42
  • Filename
    6984478