Title :
Adapting CakeDB to Integrate High-Pressure Big Data Streams with Low-Pressure Systems
Author :
Membrey, Peter ; Chan, Keith C. C. ; Demchenko, Y.
Author_Institution :
Dept. of Comput., Hong Kong Polytech. Univ., Hong Kong, China
Abstract :
Big Data continues to be one of the hottest topics in the computer science field and itself takes many forms. One way Big Data manifests is in the form of streams. These streams can be generally defined by their update frequency and the bandwidth they consume. They can however be further defined by the characteristics of the data they carry. The producers of these streams are generally tuned to perform a given role (such as moving large quantities of data with low latency) which can often be at odds with the requirements of a given consumer. In many cases the logistics of consuming such a stream can make the task impractical. This paper discusses the concept of data streams as sequential data sets and having different pressures. The paper demonstrates through a use case of a financial trading company and a High Performance Compute Cluster how different applications require different pressures and why it is necessary to be able to scale down high pressure streams for low pressure applications without impacting the applications that require the full high pressure feed and the high pressure feed itself. A proposed system for classifying streams and related consumers is discussed as well as the concept of conflation as it applies to these data streams. Features in the prototype stream oriented database (CakeDB) that support adapting high-pressure streams to low-pressure applications are then discussed and further work is identified.
Keywords :
Big Data; financial data processing; CakeDB; financial trading company; high performance compute cluster; high-pressure Big Data streams; high-pressure streams; logistics; low-pressure systems; prototype stream oriented database; sequential data sets; Companies; Feeds; Indexes; Real-time systems; Throughput; big data; cakedb; erlang; high pressure data streams; low latency data streams;
Conference_Titel :
Cloud Computing and Big Data (CloudCom-Asia), 2013 International Conference on
Conference_Location :
Fuzhou
Print_ISBN :
978-1-4799-2829-3
DOI :
10.1109/CLOUDCOM-ASIA.2013.33