• DocumentCode
    708139
  • Title

    MapReduce implementation for minimum reduct using parallel genetic algorithm

  • Author

    Alshammari, Mashaan A. ; El-Alfy, El-Sayed M.

  • Author_Institution
    Inf. & Comput. Sci. Dept., Univ. of Ha´il, Ha´il, Saudi Arabia
  • fYear
    2015
  • fDate
    7-9 April 2015
  • Firstpage
    13
  • Lastpage
    18
  • Abstract
    Rough set theory (RST) proved to be an effective approach in data mining which can be used successfully for feature/attribute selection and rule induction. Unfortunately, the search space created by RST can be huge and it is important to reduce the search time for the shortest reduct. Genetic algorithm (GA) is one of the metaheuristic algorithms that have been used to tackle this NP-hard optimization problem. However, the effectiveness of the genetic algorithm depends on its implementation. In this work, we introduce a MapReduce approach of a parallel generic algorithm to find the minimum reduct. We evaluated the proposed approach on a number of cybersecurity datasets with varying characteristics. The results showed that the MapReduce approach was more efficient than the sequential approach especially when we go for high dimensions.
  • Keywords
    data mining; feature selection; genetic algorithms; parallel algorithms; rough set theory; search problems; security of data; MapReduce implementation; NP-hard optimization problem; RST; attribute selection; cybersecurity dataset; data mining; feature selection; metaheuristic algorithm; parallel genetic algorithm; rough set theory; rule induction; search space; Communication systems; Genetic algorithms; Mathematical model; Optimization; Set theory; Sociology; Statistics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information and Communication Systems (ICICS), 2015 6th International Conference on
  • Conference_Location
    Amman
  • Type

    conf

  • DOI
    10.1109/IACS.2015.7103194
  • Filename
    7103194