Title :
Data Partition Optimization for Column-Family NoSQL Databases
Author :
Li-Yung Ho;Meng-Ju Hsieh;Jan-Jan Wu;Pangfeng Liu
Author_Institution :
Dept. of Comput. Sci. &
Abstract :
Data conversion has become an emerging topic in BigData era. To face the challenge of rapid data growth, legacy or existing relational databases have the need to convert into NoSQL column-family database in order to achieve better scalability. The conversion from SQL to NoSQL databases requires combining small, normalized SQL data tables into larger NoSQL data tables, a process called denormalization. A challenging issues in data conversion is how to group the denormalized columns in a large data table into "families" in order to ensure the performance of query processing. In this paper, we propose an efficient heuristic algorithm, GPA (Graph-based Partition Algorithm), to address this problem. We use TPC-C and TPC-H benchmarks to demonstrate that, the column-families produced by GPA is very efficient for large scale data processing.
Keywords :
"Partitioning algorithms","Relational databases","Scalability","Heuristic algorithms","Data models","Compaction"
Conference_Titel :
Smart City/SocialCom/SustainCom (SmartCity), 2015 IEEE International Conference on
DOI :
10.1109/SmartCity.2015.146