DocumentCode
708139
Title
MapReduce implementation for minimum reduct using parallel genetic algorithm
Author
Alshammari, Mashaan A. ; El-Alfy, El-Sayed M.
Author_Institution
Inf. & Comput. Sci. Dept., Univ. of Ha´il, Ha´il, Saudi Arabia
fYear
2015
fDate
7-9 April 2015
Firstpage
13
Lastpage
18
Abstract
Rough set theory (RST) proved to be an effective approach in data mining which can be used successfully for feature/attribute selection and rule induction. Unfortunately, the search space created by RST can be huge and it is important to reduce the search time for the shortest reduct. Genetic algorithm (GA) is one of the metaheuristic algorithms that have been used to tackle this NP-hard optimization problem. However, the effectiveness of the genetic algorithm depends on its implementation. In this work, we introduce a MapReduce approach of a parallel generic algorithm to find the minimum reduct. We evaluated the proposed approach on a number of cybersecurity datasets with varying characteristics. The results showed that the MapReduce approach was more efficient than the sequential approach especially when we go for high dimensions.
Keywords
data mining; feature selection; genetic algorithms; parallel algorithms; rough set theory; search problems; security of data; MapReduce implementation; NP-hard optimization problem; RST; attribute selection; cybersecurity dataset; data mining; feature selection; metaheuristic algorithm; parallel genetic algorithm; rough set theory; rule induction; search space; Communication systems; Genetic algorithms; Mathematical model; Optimization; Set theory; Sociology; Statistics;
fLanguage
English
Publisher
ieee
Conference_Titel
Information and Communication Systems (ICICS), 2015 6th International Conference on
Conference_Location
Amman
Type
conf
DOI
10.1109/IACS.2015.7103194
Filename
7103194
Link To Document