DocumentCode
643479
Title
Distributed Methodologies for Imbalanced Classification Problems: Parameter Analysis and Tuning
Author
Lemnaru, Camelia ; Bona, Audrey ; Potolea, Rodica
Author_Institution
Comput. Sci. Dept., Tech. Univ. of Cluj-Napoca, Cluj-Napoca, Romania
fYear
2013
fDate
27-30 June 2013
Firstpage
53
Lastpage
58
Abstract
Imbalanced classification problems represent a current challenge in data mining research, due to the classifiers´ inability to produce sufficiently good models in such situations. We have previously proposed a general methodology for improving the performance of classifiers under imbalance conditions: ECSB -- Evolutionary Cost-Sensitive Balancing. This paper provides an empirical analysis on a distributed approach for ECSB (dECSB). The influence of the number of splits on the quality of the output classification model is studied on several data sets and J4.8 as base classifier. The data sets have been partitioned according to the imbalance ratio and the instances per attributes ratio. We found that the appropriate number of splits is highly dependent on the problem at hand, however, an influence of the two imbalance-related factors is present. The effect of altering the genetic settings has also been investigated, in the attempt to identify several values which constantly yield good results. Again, we found the results to be highly dependent on the problem, with some data sets exhibiting low performance variations due to the genetic settings.
Keywords
data mining; evolutionary computation; pattern classification; J4.8 base classifier; data mining; distributed approach for ECSB; evolutionary cost-sensitive balancing; imbalance ratio; imbalanced classification problems; instances per attributes ratio; output classification model quality; parameter analysis; parameter tuning; Classification algorithms; Genetics; Silicon; Sociology; Statistics; Training; Tuning; Imbalanced classification; distributed methodology; empirical analysis; parameter tuning;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel and Distributed Computing (ISPDC), 2013 IEEE 12th International Symposium on
Conference_Location
Bucharest
Print_ISBN
978-1-4799-2967-2
Type
conf
DOI
10.1109/ISPDC.2013.16
Filename
6663564
Link To Document