DocumentCode :
643479
Title :
Distributed Methodologies for Imbalanced Classification Problems: Parameter Analysis and Tuning
Author :
Lemnaru, Camelia ; Bona, Audrey ; Potolea, Rodica
Author_Institution :
Comput. Sci. Dept., Tech. Univ. of Cluj-Napoca, Cluj-Napoca, Romania
fYear :
2013
fDate :
27-30 June 2013
Firstpage :
53
Lastpage :
58
Abstract :
Imbalanced classification problems represent a current challenge in data mining research, due to the classifiers´ inability to produce sufficiently good models in such situations. We have previously proposed a general methodology for improving the performance of classifiers under imbalance conditions: ECSB -- Evolutionary Cost-Sensitive Balancing. This paper provides an empirical analysis on a distributed approach for ECSB (dECSB). The influence of the number of splits on the quality of the output classification model is studied on several data sets and J4.8 as base classifier. The data sets have been partitioned according to the imbalance ratio and the instances per attributes ratio. We found that the appropriate number of splits is highly dependent on the problem at hand, however, an influence of the two imbalance-related factors is present. The effect of altering the genetic settings has also been investigated, in the attempt to identify several values which constantly yield good results. Again, we found the results to be highly dependent on the problem, with some data sets exhibiting low performance variations due to the genetic settings.
Keywords :
data mining; evolutionary computation; pattern classification; J4.8 base classifier; data mining; distributed approach for ECSB; evolutionary cost-sensitive balancing; imbalance ratio; imbalanced classification problems; instances per attributes ratio; output classification model quality; parameter analysis; parameter tuning; Classification algorithms; Genetics; Silicon; Sociology; Statistics; Training; Tuning; Imbalanced classification; distributed methodology; empirical analysis; parameter tuning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Computing (ISPDC), 2013 IEEE 12th International Symposium on
Conference_Location :
Bucharest
Print_ISBN :
978-1-4799-2967-2
Type :
conf
DOI :
10.1109/ISPDC.2013.16
Filename :
6663564
Link To Document :
بازگشت