Title :
Alovera: A Fast Stream Processing System for Large-Scale Data
Author :
Zhen´an Zhang ; Dongjie Zhang ; Xiaopeng Yu ; Jing Wang ; Chunjiang He ; Pingpeng Yuan ; Hai Jin
Author_Institution :
HAEPC Electr. Power Res. Inst., Zhengzhou, China
Abstract :
Growing of data volume poses challenges to data processing system. In this paper, Alovera, a fast stream processing system for large-scale data is presented. By using columnar data layout and stream processing, it is capable of pipelining data processing efficiently. It can process part of data instead of waiting for all data to be ready for the next operation. Thus, it can reduce the query time dramatically. Experimental results indicate significant performance improvement in a variety of tasks. In the experiments, we also evaluate our methods with different systems including HadoopDB and Hive. The extensive experiments confirm efficiency and better performance of our system.
Keywords :
data handling; pipeline processing; query processing; Alovera; HadoopDB; Hive; columnar data layout; data processing pipelining; fast stream processing system; large-scale data; Data analysis; Database systems; Engines; Layout; Loading; Optimization; Large-scale data analysis; columnar store; query execution; stream processing;
Conference_Titel :
ChinaGrid Annual Conference (ChinaGrid), 2013 8th
Conference_Location :
Changchun
Print_ISBN :
978-0-7695-5058-9
DOI :
10.1109/ChinaGrid.2013.9