• DocumentCode
    1341097
  • Title

    Traffic analysis of a Web proxy caching hierarchy

  • Author

    Mahanti, Anirban ; Williamson, Carey ; Eager, Derek

  • Author_Institution
    Saskatchewan Univ., Saskatoon, Sask., Canada
  • Volume
    14
  • Issue
    3
  • fYear
    2000
  • Firstpage
    16
  • Lastpage
    23
  • Abstract
    Understanding Web traffic characteristics is key to improving the performance and scalability of the Web. In this article Web proxy workloads from different levels of a caching hierarchy are used to understand how the workload characteristics change across different levels of a caching hierarchy. The main observations of this study are that HTML and image documents account for 95 percent of the documents seen in the workload; the distribution of transfer sizes of documents is heavy-tailed, with the tails becoming heavier as one moves up the caching hierarchy; the popularity profile of documents does not precisely follow the Zipf distribution; one-timers account for approximately 70 percent of the documents referenced; concentration of references is less at proxy caches than at servers, and concentration of references diminishes as one moves up the caching hierarchy; and the modification rate is higher at higher-level proxies
  • Keywords
    Internet; cache storage; hypermedia markup languages; information resources; search engines; telecommunication traffic; HTML documents; Web proxy caching hierarchy; Web proxy servers; Web traffic characteristics; application-level software; image documents; modification rate; performance improvement; references; traffic analysis; transfer size distribution; workload characteristics; Bandwidth; Cache memory; Delay; HTML; IP networks; Internet; Network servers; Scalability; Telecommunication traffic; Web server;
  • fLanguage
    English
  • Journal_Title
    Network, IEEE
  • Publisher
    ieee
  • ISSN
    0890-8044
  • Type

    jour

  • DOI
    10.1109/65.844496
  • Filename
    844496