• DocumentCode
    2229647
  • Title

    What-If Query Processing Policy for Big Data in OLAP System

  • Author

    Huan Xu ; Hao Luo ; Jieyue He

  • Author_Institution
    Sch. of Comput. Sci. & Eng., Southeast Univ., Nanjing, China
  • fYear
    2013
  • fDate
    13-15 Dec. 2013
  • Firstpage
    110
  • Lastpage
    116
  • Abstract
    What-if analysis focuses on analysis on hypothetical scenarios based on historical data. Therefore, it can provide more meaningful information than classical OLAP (on-line analysis processing) for the users of decision support system. As big data OLAP systems are always based on the computation model of MapReduce, of which the advantage is to handle large data sets in batch-processing mode, however it is not suitable for real-time response requirements. It is a most key step to merge delta-table in the process of what-if. However, classical delta-table merge algorithms are seriously restricted in time and space. Multi-Scenario hypothesis, which is upon historical data in big data analytical processing, needs efficient what-if data view support. Therefore, two novel algorithms based on Bloom filter and distributed cache, which can significantly improve the performance of delta table merging algorithm, are proposed in this paper. Finally, compared with Hive on standard SSB data set, our algorithm, which is based on Bloom filter, is demonstrated to be 30% faster. In the case of smaller delta table, even more improvements can be achieved by the algorithm based on distributed cache.
  • Keywords
    Big Data; cache storage; data mining; data structures; decision support systems; distributed processing; merging; query processing; Bloom filter; Hive; MapReduce computation model; OLAP system; batch-processing mode; big data analytical processing; classical delta-table merge algorithms; decision support system; distributed cache; historical data; multiscenario hypothesis; online analysis processing; standard SSB data set; what-if analysis; what-if query processing policy; Algorithm design and analysis; Data warehouses; Filtering algorithms; Matched filters; Merging; Bloom Filter; Delta Table; Hadoop; MapReduce; OLAP; What-if Analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advanced Cloud and Big Data (CBD), 2013 International Conference on
  • Conference_Location
    Nanjing
  • Print_ISBN
    978-1-4799-3260-3
  • Type

    conf

  • DOI
    10.1109/CBD.2013.40
  • Filename
    6824582