• DocumentCode
    176854
  • Title

    Design of real-time data analysis system based on Impala

  • Author

    Jingmin Li

  • Author_Institution
    Sch. of Comput. Sci., Wuyi Univ., Jiangmen, China
  • fYear
    2014
  • fDate
    29-30 Sept. 2014
  • Firstpage
    934
  • Lastpage
    936
  • Abstract
    With the continuous development of Internet technology, from a mass of data real-time, efficient analysis and dig out the valuable information, especially important for enterprises. At present, relatively common practice is built up data analysis system in the Hadoop environment based on Hive. But it is more suitable for the batch processing in large data of clusters, and is not suitable for the real-time processing of large data requirements brought about by the development of the business adjustment. This paper presents a real-time data analysis system based on Impala. It can be used as a good supplement scheme. This paper will explain the thought and method of the construction of the real-time data analysis system based on Impala, from the system selection, system architecture, and practical.
  • Keywords
    Big Data; Internet; business data processing; data analysis; public domain software; Big Data; Hadoop environment; Hive; Impala; Internet technology; batch processing; business adjustment; real-time data analysis system design; system architecture; system selection; Batch production systems; Big data; Computer architecture; Conferences; Data analysis; Peer-to-peer computing; Real-time systems; Big data; Hadoop; Impala; Real-time data analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advanced Research and Technology in Industry Applications (WARTIA), 2014 IEEE Workshop on
  • Conference_Location
    Ottawa, ON
  • Type

    conf

  • DOI
    10.1109/WARTIA.2014.6976427
  • Filename
    6976427