DocumentCode :
1974480
Title :
MapReduce as a general framework to support research in Mining Software Repositories (MSR)
Author :
Shang, Weiyi ; Jiang, Zhen Ming ; Adams, Bram ; Hassan, Ahmed E.
Author_Institution :
Software Anal. & Intell. Lab. (SAIL), Queen´´s Univ., Kingston, ON
fYear :
2009
fDate :
16-17 May 2009
Firstpage :
21
Lastpage :
30
Abstract :
Researchers continue to demonstrate the benefits of Mining Software Repositories (MSR) for supporting software development and research activities. However, as the mining process is time and resource intensive, they often create their own distributed platforms and use various optimizations to speed up and scale up their analysis. These platforms are project-specific, hard to reuse, and offer minimal debugging and deployment support. In this paper, we propose the use of MapReduce, a distributed computing platform, to support research in MSR. As a proof-of-concept, we migrate J-REX, an optimized evolutionary code extractor, to run on Hadoop, an open source implementation of MapReduce. Through a case study on the source control repositories of the Eclipse, BIRT and Datatools projects, we demonstrate that the migration effort to MapReduce is minimal and that the benefits are significant, as running time of the migrated J-REX is only 30% to 50% of the original J-REX´s. This paper documents our experience with the migration, and highlights the benefits and challenges of the MapReduce framework in the MSR community.
Keywords :
data mining; distributed algorithms; distributed programming; evolutionary computation; public domain software; software engineering; Hadoop open source platform; J-REX optimized evolutionary code extractor; MapReduce framework; distributed computing; mining software repository; software debugging; software development; Data mining; Debugging; Distributed computing; Kernel; Linux; Programming; Scalability; Software performance; Software systems; Workstations;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Mining Software Repositories, 2009. MSR '09. 6th IEEE International Working Conference on
Conference_Location :
Vancouver, BC
Print_ISBN :
978-1-4244-3493-0
Type :
conf
DOI :
10.1109/MSR.2009.5069477
Filename :
5069477
Link To Document :
بازگشت