• DocumentCode
    243695
  • Title

    Challenges for MapReduce in Big Data

  • Author

    Grolinger, Katarina ; Hayes, Michael ; Higashino, Wilson Akio ; L´Heureux, Alexandra ; Allison, David S. ; Capretz, Miriam A. M.

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Western Univ., London, ON, Canada
  • fYear
    2014
  • fDate
    June 27 2014-July 2 2014
  • Firstpage
    182
  • Lastpage
    189
  • Abstract
    In the Big Data community, MapReduce has been seen as one of the key enabling approaches for meeting continuously increasing demands on computing resources imposed by massive data sets. The reason for this is the high scalability of the MapReduce paradigm which allows for massively parallel and distributed execution over a large number of computing nodes. This paper identifies MapReduce issues and challenges in handling Big Data with the objective of providing an overview of the field, facilitating better planning and management of Big Data projects, and identifying opportunities for future research in this field. The identified challenges are grouped into four main categories corresponding to Big Data tasks types: data storage (relational databases and NoSQL stores), Big Data analytics (machine learning and interactive analytics), online processing, and security and privacy. Moreover, current efforts aimed at improving and extending MapReduce to address identified challenges are presented. Consequently, by identifying issues and challenges MapReduce faces when handling Big Data, this study encourages future Big Data research.
  • Keywords
    Big Data; SQL; data analysis; data privacy; learning (artificial intelligence); parallel programming; relational databases; security of data; storage management; Big Data analytics; Big Data community; Big Data project management; Big Data project planning; MapReduce paradigm; NoSQL stores; data privacy; data security; data storage; interactive analytics; machine learning; massive data sets; massively distributed execution; massively parallel execution; online processing; relational databases; Algorithm design and analysis; Big data; Data models; Data visualization; Memory; Scalability; Security; Big Data; Big Data Analytics; Interactive Analytics; Machine Learning; MapReduce; NoSQL; Online Processing; Privacy; Security;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Services (SERVICES), 2014 IEEE World Congress on
  • Conference_Location
    Anchorage, AK
  • Print_ISBN
    978-1-4799-5068-3
  • Type

    conf

  • DOI
    10.1109/SERVICES.2014.41
  • Filename
    6903263