مرکز منطقه ای اطلاع رساني علوم و فناوري - The best of two worlds: Integrating IBM InfoSphere Streams with Apache YARN

DocumentCode :

1791764

Title :

The best of two worlds: Integrating IBM InfoSphere Streams with Apache YARN

Author :

Nabi, Zubair ; Wagle, Rohit ; Bouillet, Eric

Author_Institution :

IBM Res. - Ireland, Dublin, Ireland

fYear :

2014

fDate :

27-30 Oct. 2014

Firstpage :

Lastpage :

Abstract :

The seamless confluence of data in motion and data at rest has the potential to redefine the Big Data analytics landscape in a diverse range of domains. To make this happen, existing data intensive computing frameworks need to be repurposed and integrated at control, data, and management levels. Towards this end, we present the system level integration of IBM InfoSphere Streams with Apache YARN. Our design leverages the key differentiating features of the two frameworks to blend high throughput batch-processing with near line-rate, low latency stream-processing. In addition, both frameworks are able to share resources and offer the same interfaces that their users are accustomed to. Using two real-world examples, we illustrate how such a system can be used in production.

Keywords :

Big Data; data analysis; user interfaces; Apache YARN; Big Data analytics; IBM InfoSphere Streams; batch-processing; data intensive computing frameworks; stream-processing; user interfaces; Big data; Computer architecture; Containers; Libraries; Real-time systems; Resource management; Yarn; batch-processing; cluster management; stream-processing;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Big Data (Big Data), 2014 IEEE International Conference on

Conference_Location :

Washington, DC

Type :

conf

DOI :

10.1109/BigData.2014.7004443

Filename :

7004443

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1791764