• DocumentCode
    3040924
  • Title

    Class-based cache management for dynamic Web content

  • Author

    Zhu, Huican ; Yang, Tao

  • Author_Institution
    Dept. of Comput. Sci., California Univ., Santa Barbara, CA, USA
  • Volume
    3
  • fYear
    2001
  • fDate
    2001
  • Firstpage
    1215
  • Abstract
    Caching dynamic pages at a server site is beneficial in reducing server resource demands and it also helps dynamic page caching at proxy sites. Previous work has used fine-grain dependence graphs among individual dynamic pages and underlying data sets to enforce result consistency. This paper proposes a complementary solution for applications that require coarse-grain cache management. The key idea is to partition dynamic pages into classes based on URL patterns so that an application can specify page identification and data dependence, and invoke invalidation for a class of dynamic pages. To make this scheme time-efficient with small space requirement, lazy invalidation is used to minimize slow disk accesses when IDs of dynamic pages are stored in memory with a digest format. Selective precomputing is further proposed to refresh stale pages and smoothen load peaks. A data structure is developed for efficient URL class searching during lazy or eager invalidation. This paper also presents design and implementation of a caching system called Cachuma which integrates the above techniques, runs in tandem with standard Web servers, and allows Web sites to add dynamic page caching capability with minimal changes. The experimental results show that the proposed techniques are effective in supporting coarse-grain cache management and reducing server response times for tested applications
  • Keywords
    Internet; cache storage; client-server systems; data structures; file servers; Cachuma; URL patterns; class-based cache management; coarse-grain cache management; data dependence; data structure; digest format; dynamic Web content; dynamic page caching; dynamic pages; eager invalidation; lazy invalidation; load peaks; page identification; precomputing; proxy sites; server resource demands; server response times; server site; slow disk accesses; stale pages; Computer science; Content management; Data structures; Intrusion detection; Network servers; Pattern matching; Prefetching; Uniform resource locators; Web page design; Web server;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    INFOCOM 2001. Twentieth Annual Joint Conference of the IEEE Computer and Communications Societies. Proceedings. IEEE
  • Conference_Location
    Anchorage, AK
  • ISSN
    0743-166X
  • Print_ISBN
    0-7803-7016-3
  • Type

    conf

  • DOI
    10.1109/INFCOM.2001.916615
  • Filename
    916615