• DocumentCode
    1798123
  • Title

    An algorithmic framework based on the binarization approach for supervised and semi-supervised multiclass problems

  • Author

    Sen, Arunabha ; Islam, Md Minarul ; Murase, K.

  • fYear
    2014
  • fDate
    6-11 July 2014
  • Firstpage
    175
  • Lastpage
    182
  • Abstract
    Using a set of binary classifiers to solve the multiclass classification problem has been a popular approach over the years. This technique is known as binarization. The decision boundary that these binary classifiers (also called base classifiers) have to learn is much simpler than the decision boundary of a multiclass classifier. But binarization gives rise to a new problem called the class imbalance problem. Class imbalance problem occurs when the data set used for training has relatively less data items for one class than for another class. This problem becomes more severe if the original data set itself was imbalanced. Furthermore, binarization has only been implemented in the domain of supervised classification. In this paper, we propose a framework called Binarization with Boosting and Oversampling (BBO). Our framework can handle the class imbalance problem arising from binarization. As the name of the framework suggests, this is achieved through a combination of boosting and oversampling. BBO framework can be used with any supervised classification algorithm. Moreover, unlike any other binarization approaches used earlier, we apply our framework with semi-supervised classification as well. BBO framework has been rigorously tested with a number of benchmark data sets from UCI machine learning repository. The experimental results show that using the BBO framework achieves a higher accuracy than the traditional binarization approach.
  • Keywords
    learning (artificial intelligence); pattern classification; BBO; base classifier; binarization approach; binarization with boosting and oversampling; binary classifier; class imbalance problem; decision boundary; machine learning repository; multiclass classification problem; semisupervised multiclass problem; supervised classification algorithm; Accuracy; Artificial neural networks; Boosting; Partitioning algorithms; Training; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks (IJCNN), 2014 International Joint Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4799-6627-1
  • Type

    conf

  • DOI
    10.1109/IJCNN.2014.6889793
  • Filename
    6889793