DocumentCode
176854
Title
Design of real-time data analysis system based on Impala
Author
Jingmin Li
Author_Institution
Sch. of Comput. Sci., Wuyi Univ., Jiangmen, China
fYear
2014
fDate
29-30 Sept. 2014
Firstpage
934
Lastpage
936
Abstract
With the continuous development of Internet technology, from a mass of data real-time, efficient analysis and dig out the valuable information, especially important for enterprises. At present, relatively common practice is built up data analysis system in the Hadoop environment based on Hive. But it is more suitable for the batch processing in large data of clusters, and is not suitable for the real-time processing of large data requirements brought about by the development of the business adjustment. This paper presents a real-time data analysis system based on Impala. It can be used as a good supplement scheme. This paper will explain the thought and method of the construction of the real-time data analysis system based on Impala, from the system selection, system architecture, and practical.
Keywords
Big Data; Internet; business data processing; data analysis; public domain software; Big Data; Hadoop environment; Hive; Impala; Internet technology; batch processing; business adjustment; real-time data analysis system design; system architecture; system selection; Batch production systems; Big data; Computer architecture; Conferences; Data analysis; Peer-to-peer computing; Real-time systems; Big data; Hadoop; Impala; Real-time data analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Advanced Research and Technology in Industry Applications (WARTIA), 2014 IEEE Workshop on
Conference_Location
Ottawa, ON
Type
conf
DOI
10.1109/WARTIA.2014.6976427
Filename
6976427
Link To Document