• DocumentCode
    257340
  • Title

    Hippo: An enhancement of pipeline-aware in-memory caching for HDFS

  • Author

    Lan Wei ; Wenbo Lian ; Kuien Liu ; Yongji Wang

  • Author_Institution
    Inst. of Software, Beijing, China
  • fYear
    2014
  • fDate
    4-7 Aug. 2014
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    In the age of big data, distributed computing frameworks tend to coexist and collaborate in pipeline using one scheduler. While a variety of techniques for reducing I/O latency have been proposed, these are rarely specific for the whole pipeline performance. This paper proposes memory management logic called “Hippo” which targets distributed systems and in particular “pipelined” applications that might span differing big data frameworks. Though individual frameworks may have internal memory management primitives, Hippo proposes to make a generic framework that works agnostic of these highlevel operations. To increase the hit ratio of in-memory cache, this paper discusses the granularity of caching and how Hippo leverages the job dependency graph to make memory retention and pre-fetching decisions. Our evaluations demonstrate that job dependency is essential to improve the cache performance and a global cache policy maker, in most cases, significantly outperforms explicit caching by users.
  • Keywords
    Big Data; cache storage; pipeline processing; HDFS; Hippo; I/O latency reducing; big data frameworks; caching granularity; distributed computing frameworks; distributed systems; in-memory cache; internal memory management primitives; job dependency; job dependency graph; memory management logic; memory retention; pipeline performance; pipeline-aware in-memory caching; pipelined applications; prefetching decisions; Big data; Memory management; Optimization; Pipelines; Prefetching; Sparks; Throughput;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Communication and Networks (ICCCN), 2014 23rd International Conference on
  • Conference_Location
    Shanghai
  • Type

    conf

  • DOI
    10.1109/ICCCN.2014.6911847
  • Filename
    6911847