Title :
Efficient data distribution in a Web server farm
Author :
Burns, Randal C. ; Rees, Robert M. ; Long, Darrell D E
Author_Institution :
IBM Almaden Res. Center, San Jose, CA, USA
Abstract :
High-performance Web sites rely on Web server `farms´, hundreds of computers serving the same content, for scalability, reliability, and low-latency access to Internet content. Deploying these scalable farms typically requires the power of distributed or clustered file systems. Building Web server farms on file systems complements hierarchical proxy caching. Proxy caching replicates Web content throughout the Internet, thereby reducing latency from network delays and off-loading traffic from the primary servers. Web server farms scale resources at a single site, reducing latency from queuing delays. Both technologies are essential when building a high-performance infrastructure for content delivery. The authors present a cache consistency model and locking protocol customized for file systems that are used as scalable infrastructure for Web server farms. The protocol takes advantage of the Web´s relaxed consistency semantics to reduce latencies and network overhead. Our hybrid approach preserves strong consistency for concurrent write sharing with time-based consistency and push caching for readers (Web servers). Using simulation, we compare our approach to the Andrew file system and the sequential consistency file system protocols we propose to replace
Keywords :
Internet; concurrency control; data integrity; file servers; information resources; protocols; Andrew file system; Internet; Internet content; Web content; Web server farm; Web server farms; cache consistency model; clustered file systems; concurrent write sharing; content delivery; data distribution; file systems; hierarchical proxy caching; high-performance Web sites; high-performance infrastructure; hybrid approach; locking protocol; low-latency access; network delays; network overhead; primary servers; push caching; queuing delays; readers; relaxed consistency semantics; scalable farms; scalable infrastructure; sequential consistency file system protocols; strong consistency; time-based consistency; Computer network reliability; Delay; File systems; IP networks; Internet; Power system reliability; Protocols; Scalability; Telecommunication traffic; Web server;
Journal_Title :
Internet Computing, IEEE
DOI :
10.1109/4236.939451