• DocumentCode
    1642472
  • Title

    Differentiating between individual class performance in Genetic Programming fitness for classification with unbalanced data

  • Author

    Bhowan, Urvesh ; Johnston, Mark ; Zhang, Mengjie

  • Author_Institution
    Victoria Univ. of Wellington, Wellington
  • fYear
    2009
  • Firstpage
    2802
  • Lastpage
    2809
  • Abstract
    This paper investigates improvements to the fitness function in Genetic Programming to better solve binary classification problems with unbalanced data. Data sets are unbalanced when there is a majority of examples for one particular class over the other class(es). We show that using overall classification accuracy as the fitness function evolves classifiers with a performance bias toward the majority class at the expense of minority class performance. We develop four new fitness functions which consider the accuracy of majority and minority class separately to address this learning bias. Results using these fitness functions show that good accuracy for both the minority and majority classes can be achieved from evolved classifiers while keeping overall performance high and balanced across the two classes.
  • Keywords
    genetic algorithms; pattern classification; binary classification problem; data sets; evolved classifiers; fitness function; genetic programming fitness; minority class performance; unbalanced data; Computer science; Data engineering; Error analysis; Genetic programming; Humans; Image recognition; Machine learning; Measurement standards; Medical diagnosis; Sampling methods;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Evolutionary Computation, 2009. CEC '09. IEEE Congress on
  • Conference_Location
    Trondheim
  • Print_ISBN
    978-1-4244-2958-5
  • Electronic_ISBN
    978-1-4244-2959-2
  • Type

    conf

  • DOI
    10.1109/CEC.2009.4983294
  • Filename
    4983294