DocumentCode :
708139
Title :
MapReduce implementation for minimum reduct using parallel genetic algorithm
Author :
Alshammari, Mashaan A. ; El-Alfy, El-Sayed M.
Author_Institution :
Inf. & Comput. Sci. Dept., Univ. of Ha´il, Ha´il, Saudi Arabia
fYear :
2015
fDate :
7-9 April 2015
Firstpage :
13
Lastpage :
18
Abstract :
Rough set theory (RST) proved to be an effective approach in data mining which can be used successfully for feature/attribute selection and rule induction. Unfortunately, the search space created by RST can be huge and it is important to reduce the search time for the shortest reduct. Genetic algorithm (GA) is one of the metaheuristic algorithms that have been used to tackle this NP-hard optimization problem. However, the effectiveness of the genetic algorithm depends on its implementation. In this work, we introduce a MapReduce approach of a parallel generic algorithm to find the minimum reduct. We evaluated the proposed approach on a number of cybersecurity datasets with varying characteristics. The results showed that the MapReduce approach was more efficient than the sequential approach especially when we go for high dimensions.
Keywords :
data mining; feature selection; genetic algorithms; parallel algorithms; rough set theory; search problems; security of data; MapReduce implementation; NP-hard optimization problem; RST; attribute selection; cybersecurity dataset; data mining; feature selection; metaheuristic algorithm; parallel genetic algorithm; rough set theory; rule induction; search space; Communication systems; Genetic algorithms; Mathematical model; Optimization; Set theory; Sociology; Statistics;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information and Communication Systems (ICICS), 2015 6th International Conference on
Conference_Location :
Amman
Type :
conf
DOI :
10.1109/IACS.2015.7103194
Filename :
7103194
Link To Document :
بازگشت