Title :
DM-Midware: A Middleware to Enable High Performance Data Mining in Heterogeneous Cloud
Author :
Guoyu Ou ; Ying Liu ; Xinyu Ma ; Cheng Wang
Author_Institution :
Sch. of Comput. & Control, Univ. of Chinese Acad. of Sci., Beijing, China
Abstract :
Cloud computing has become a popular high performance computing model where resources are provided as services over the Web. Users are starting to adopt cloud model in data mining applications. However, due to the complexity of parallel/cloud computing, it is difficult for average users to express a parallel computing paradigm for their applications in cloud. In order to isolate users from the complexity of parallel/cloud programming, a middleware to enable high performance data mining, called DM-Midware, is proposed. It hides the details of MapReduce programming from users by automatically launching mappers through a set of user programming APIs. Directive-based parallelization scheme automatically "translates" a serial program into a SMP or Multi-core based parallel program. Heterogeneous computing resources can be invoked to conduct parallel execution by API-based scheme. A two-step scheduling scheme is proposed to maximize the throughput of the cloud system. We evaluate DM-Midware by executing a representative data mining algorithm in a private cloud. Experimental results demonstrate good scalability and adaptability.
Keywords :
Web services; cloud computing; data mining; middleware; parallel programming; scheduling; DM-Midware; MapReduce programming; SMP; Web service; cloud computing; directive-based parallelization scheme; heterogeneous cloud; heterogeneous computing resources; high performance computing model; middleware; multicore based parallel program; parallel computing; parallel execution; private cloud; representative data mining algorithm; serial program; two-step scheduling scheme; user programming APIs; Algorithm design and analysis; Cloud computing; Data mining; Graphics processing units; Parallel processing; Programming; cloud computing; data mining; parallel computing;
Conference_Titel :
Web Intelligence (WI) and Intelligent Agent Technologies (IAT), 2013 IEEE/WIC/ACM International Joint Conferences on
Conference_Location :
Atlanta, GA
Print_ISBN :
978-1-4799-2902-3
DOI :
10.1109/WI-IAT.2013.152