DocumentCode :
3316257
Title :
Performance analysis of a fault-tolerant exact motif mining algorithm on the cloud
Author :
Nhan Nguyen ; Khan, Mohammad Maifi Hasan
Author_Institution :
Dept. of Comput. Sci. & Eng., Univ. of Connecticut, Storrs, CT, USA
fYear :
2013
fDate :
6-8 Dec. 2013
Firstpage :
1
Lastpage :
9
Abstract :
In this paper, we present the performance analysis and design challenges of implementing a fault-tolerant parallel exact motif mining algorithm leveraging the services provided by the underlying cloud storage platform (e.g., data replication, node failure detection). More specifically, first, we present the design of the intermediate data structures and data models that are needed for effective parallelization of the motif mining algorithm on the cloud. Second, we present the design and implementation of a fault-tolerant parallel motif mining algorithm that enables the data analytic system to recover from arbitrary node failures in the cloud environment by detecting node failures and redistributing remaining computational tasks in real-time. We also present a data caching scheme to improve the system performance even further. We evaluated the impact of various factors such as the replication factor and random node failures on the performance of our system using two different datasets, namely, an EOG dataset and an image dataset. In both cases, our algorithm exhibits superior performance over the existing algorithms, thus demonstrating the effectiveness of our presented system.
Keywords :
cache storage; cloud computing; data analysis; data models; software fault tolerance; time series; EOG dataset; arbitrary node failures detection; cloud environment; cloud storage platform; data analytic system; data caching scheme; data models; design challenges; fault-tolerant parallel exact motif mining algorithm; image dataset; intermediate data structures design; parallelization; random node failures; replication factor; system performance analysis; time series; Algorithm design and analysis; Cloud computing; Data mining; Fault tolerance; Fault tolerant systems; Sensors; Time series analysis; cloud; motif; time series;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Performance Computing and Communications Conference (IPCCC), 2013 IEEE 32nd International
Conference_Location :
San Diego, CA
Print_ISBN :
978-1-4799-3213-9
Type :
conf
DOI :
10.1109/PCCC.2013.6742786
Filename :
6742786
Link To Document :
بازگشت