DocumentCode :
249317
Title :
XDB - A Novel Database Architecture for Data Analytics as a Service
Author :
Binnig, Carsten ; Salama, Abdallah ; Zamanian, Erfan ; Kornmayer, Harald ; Listing, Sven ; Mueller, Alexander C.
Author_Institution :
Baden-Wuerttemberg Cooperative State Univ. Mannheim, Mannheim, Germany
fYear :
2014
fDate :
June 27 2014-July 2 2014
Firstpage :
96
Lastpage :
103
Abstract :
Parallel shared-nothing database systems are major platforms for efficiently analyzing large amounts of structured data. However, in order to offer SQL-like services for data analytics in the cloud, providers such as Amazon and Google do not use these systems as a basis. A major reason for this trend is that existing parallel shared-nothing database systems are expensive and that they do not fulfill many of the requirements such as elasticity and fault-tolerance needed for providing a service for data analytics in the cloud. In this paper, we present an overview of an elastic and fault-tolerant database system called XDB, which supports complex analytics. XDB builds on the following novel concepts: (1) a partitioning scheme that supports elasticity with regard to data and queries, (2) a cost-based fault-tolerance scheme that allows to recover from mid-query faults, and (3) adaptive parallelization techniques to better support complex analytical queries. XDB is implemented using a middleware approach on top of multiple nodes each hosting an instance of a single node database system (MySQL in our prototype). Initial experiments show that our novel concepts effectively support elasticity, fault-tolerance and complex analytics when compared to the traditional behavior of existing databases.
Keywords :
SQL; cloud computing; middleware; parallel databases; query processing; software architecture; MySQL; SQL-like services; XDB; adaptive parallelization techniques; cloud computing; complex analytical queries; complex data analytics; cost-based fault-tolerance scheme; database architecture; elastic fault-tolerant database; mid-query fault recovery; middleware approach; multiple nodes; parallel shared-nothing database systems; partitioning scheme; query processing; single-node database system; structured data analysis; Elasticity; Fault tolerance; Fault tolerant systems; Middleware; Query processing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Big Data (BigData Congress), 2014 IEEE International Congress on
Conference_Location :
Anchorage, AK
Print_ISBN :
978-1-4799-5056-0
Type :
conf
DOI :
10.1109/BigData.Congress.2014.23
Filename :
6906766
Link To Document :
بازگشت