Title :
Design of real-time data analysis system based on Impala
Author_Institution :
Sch. of Comput. Sci., Wuyi Univ., Jiangmen, China
Abstract :
With the continuous development of Internet technology, from a mass of data real-time, efficient analysis and dig out the valuable information, especially important for enterprises. At present, relatively common practice is built up data analysis system in the Hadoop environment based on Hive. But it is more suitable for the batch processing in large data of clusters, and is not suitable for the real-time processing of large data requirements brought about by the development of the business adjustment. This paper presents a real-time data analysis system based on Impala. It can be used as a good supplement scheme. This paper will explain the thought and method of the construction of the real-time data analysis system based on Impala, from the system selection, system architecture, and practical.
Keywords :
Big Data; Internet; business data processing; data analysis; public domain software; Big Data; Hadoop environment; Hive; Impala; Internet technology; batch processing; business adjustment; real-time data analysis system design; system architecture; system selection; Batch production systems; Big data; Computer architecture; Conferences; Data analysis; Peer-to-peer computing; Real-time systems; Big data; Hadoop; Impala; Real-time data analysis;
Conference_Titel :
Advanced Research and Technology in Industry Applications (WARTIA), 2014 IEEE Workshop on
Conference_Location :
Ottawa, ON
DOI :
10.1109/WARTIA.2014.6976427