DocumentCode :
607273
Title :
Performance of scalable off-the-shelf hardware for data-intensive parallel processing using MapReduce
Author :
Ahmad Fadzil, Ahmad Firdaus ; Abdul Khalid, Noor Elaiza ; Manaf, Mazani
Author_Institution :
Fac. of Comput. & Math. Sci., Univ. Teknol. MARA (UiTM), Shah Alam, Malaysia
fYear :
2012
fDate :
3-5 Dec. 2012
Firstpage :
379
Lastpage :
384
Abstract :
Large data and information processing requires high processing power that usually involve supercomputers which are costly. MapReduce parallel framework introduces an automated way of distributing these large processes to many computers. This paper proposes to conduct preliminary studies on scalability using MapReduce as an automated parallel processing running on low-cost off-the-shelf hardware. The system architecture is built with collections of off-the-shelf hardware. The scalability test will be conducted by adding an off-the-shelf hardware one at a time to the architecture. MapReduce tool is used as a parallel framework to automatically distribute tasks according to available resources. Performance will be evaluated based on improvement in speedup. It is found that MapReduce is able to accommodate scalability of off-the-shelf hardware resources by automatically distributing tasks regardless of the number of hardware being added to the architecture.
Keywords :
parallel processing; MapReduce tool; automated parallel processing; data-intensive parallel processing; information processing; off-the-shelf hardware scalability; supercomputers; system architecture; MapReduce; Off-the-shelf hardware; Parallel processing; scalability;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computing and Convergence Technology (ICCCT), 2012 7th International Conference on
Conference_Location :
Seoul
Print_ISBN :
978-1-4673-0894-6
Type :
conf
Filename :
6530362
Link To Document :
بازگشت