• DocumentCode
    2295010
  • Title

    An Improved Two-Step Supervised Learning Artificial Neural Network for Imbalanced Dataset Problems

  • Author

    Shamsudin, Hasrul Che ; Adam, Asrul ; Shapiai, Mohd Ibrahim ; Basri, Mohd Ariffanan Mohd ; Ibrahim, Zuwairie ; Khalid, Marzuki

  • Author_Institution
    Fac. of Electr. Eng., Univ. Teknol. Malaysia, Skudai, Malaysia
  • fYear
    2011
  • fDate
    20-22 Sept. 2011
  • Firstpage
    108
  • Lastpage
    113
  • Abstract
    An improved two-step supervised learning algorithm of Artificial Neural Networks (ANN) for imbalanced dataset problems is proposed in this paper. Particle swarm optimization (PSO) is utilized as ANN learning mechanism for first step and second step. The fitness function for both steps is Geometric Mean (G-Mean). Firstly, the best weights on network are determined with a decision threshold is set to 0.5. After the first step learning is accomplished, the best weights will be used for second step learning. The best weights with the best value of decision threshold are obtained and can be used to predict an imbalanced dataset. Haberman´s Survival datasets, which is available in UCI Machine Learning Repository, is chosen as a case study. G-Mean is chosen as the evaluation method to define the classifier´s performance for a case study. Consequently, the proposed approach is able to overcome imbalanced dataset problems with better G-Mean value compared to the previously proposed ANN.
  • Keywords
    data handling; geometry; learning (artificial intelligence); neural nets; particle swarm optimisation; ANN learning mechanism; Haberman survival datasets; UCI machine learning repository; decision threshold; geometric mean; imbalanced dataset problems; particle swarm optimization; two step supervised learning artificial neural network; Artificial neural networks; Classification algorithms; Supervised learning; Testing; Training; artificial neural network; binary classification; imbalanced dataset problems; particle swarm optimization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence, Modelling and Simulation (CIMSiM), 2011 Third International Conference on
  • Conference_Location
    Langkawi
  • Print_ISBN
    978-1-4577-1797-0
  • Type

    conf

  • DOI
    10.1109/CIMSim.2011.28
  • Filename
    6076341