• DocumentCode
    3452946
  • Title

    Discretionary caching for I/O on clusters

  • Author

    Vilayannur, Murali ; Sivasubramaniam, Anand ; Kandemir, Mahmut ; Thakur, Rajeev ; Ross, Robert

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Pennsylvania State Univ., University Park, PA, USA
  • fYear
    2003
  • fDate
    12-15 May 2003
  • Firstpage
    96
  • Lastpage
    103
  • Abstract
    I/O bottlenecks are already a problem in many largescale applications that manipulate huge datasets. This problem is expected to get worse as applications get larger, and the I/O subsystem performance lags behind processor and memory speed improvements. Caching I/O blocks is one effective way of alleviating disk latencies, and there can be multiple levels of caching on a cluster of workstations. Previous studies have shown the benefits of caching whether it be local to a particular node, or a shared global cache across the cluster - for certain applications. However, we show that while caching is useful in some situations, it can hurt performance if we are not careful about what to cache and when to bypass the cache. This paper presents compilation techniques and runtime support to address this problem. These techniques are implemented and evaluated on an experimental Linux/Pentium cluster running a parallel file system. Our results using a diverse set of applications (scientific and commercial) demonstrate the benefits of a discretionary approach to caching for I/O subsystems on clusters, providing as much as 33% savings over indiscriminately caching everything in some applications.
  • Keywords
    cache storage; optimising compilers; parallel architectures; parallel machines; workstation clusters; Linux/Pentium cluster; caching I/O blocks; compilation techniques; disk latency; parallel file system; runtime support; workstation clusters; Application software; Computer science; Costs; Data engineering; Delay; File systems; Hardware; Laboratories; Mathematics; Runtime;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cluster Computing and the Grid, 2003. Proceedings. CCGrid 2003. 3rd IEEE/ACM International Symposium on
  • Print_ISBN
    0-7695-1919-9
  • Type

    conf

  • DOI
    10.1109/CCGRID.2003.1199357
  • Filename
    1199357