DocumentCode :
692990
Title :
Thump storage: A management and analysis system for structured big data
Author :
Xu Tao ; Fu Ge ; Tan Huaiyuan ; Zhang Hong ; Liu Xinran
Author_Institution :
Dept. of Comput. Sci. & Technol., Tsinghua Univ., Beijing, China
fYear :
2013
fDate :
20-22 Dec. 2013
Firstpage :
2424
Lastpage :
2427
Abstract :
Structured big data presents a great challenge to the current platform and technology. It is a crossing field of two technological fields, namely the distributed-storage and parallel-processing of massive data, and relation-oriented database. In this paper, we present a management and analysis system for structured big data called ThumpStorage by integrating the bottom distributed structure of Hadoop distributed file system (HDFS) and the partitioning and scheduling technology of the massive parallel processing (MPP) database. This system shows high efficiency, low latency and high scalability. Finally, we test ThumpStorage by single table query, multi table query and concurrent jobs under different cluster node numbers, and compare with the test result of Hive.
Keywords :
Big Data; relational databases; HDFS; Hadoop distributed file system; MPP database; ThumpStorage; analysis system; cluster node numbers; concurrent jobs; distributed storage; distributed structure; management; massive data; massive parallel processing; multitable query; relation-oriented database; scheduling technology; single table query; structured big data; Big data; Distributed databases; Engines; Monitoring; Scalability; Servers; Big Data; HDFS; Hive; MPP; MapReduce;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Mechatronic Sciences, Electric Engineering and Computer (MEC), Proceedings 2013 International Conference on
Conference_Location :
Shengyang
Print_ISBN :
978-1-4799-2564-3
Type :
conf
DOI :
10.1109/MEC.2013.6885442
Filename :
6885442
Link To Document :
بازگشت