Title :
Strategies for Training Robust Neural Network Based Digit Recognizers on Unbalanced Data Sets
Author :
Vajda, Szilárd ; Fink, Gernot A.
Author_Institution :
Dept. of Comput. Sci., Tech. Univ. Dortmund, Dortmund, Germany
Abstract :
The performance of a neural network in a pattern recognition task may be influenced by several factors. One of these factors is related to the considerable difference between the number of examples belonging to each class to be recognized. The effect called imbalanced data can negatively influence the ability of a recognizer to learn the concept of the minority class. In this work we propose an under-sampling strategy based on selecting samples lying around the decision surface and an over-sampling strategy which uses kernel density estimation to populate the minority class. The experimental results on Roman and Bangla digit data using a neural network based recognizer confirm the effectiveness of the proposed solutions.
Keywords :
digital arithmetic; neural nets; pattern recognition; signal sampling; text analysis; decision surface; digit recognizers; kernel density estimation; neural network; over sampling strategy; pattern recognition; training; unbalanced data sets; under-sampling strategy; digit recognition; kernel density estimation; unbalanced data;
Conference_Titel :
Frontiers in Handwriting Recognition (ICFHR), 2010 International Conference on
Conference_Location :
Kolkata
Print_ISBN :
978-1-4244-8353-2
DOI :
10.1109/ICFHR.2010.30