DocumentCode :
3621990
Title :
A Contribution Towards Solving the Web Workload Puzzle
Author :
K. Goseva-Popstojanova; Fengbin Li; Xuan Wang;A. Sangle
Author_Institution :
West Virginia University, Morgantown, WV
fYear :
2006
fDate :
6/28/1905 12:00:00 AM
Firstpage :
505
Lastpage :
516
Abstract :
World Wide Web, the biggest distributed system ever built, experiences tremendous growth and change in Web sites, users, and technology. A realistic and accurate characterization of Web workload is the first, fundamental step in areas such as performance analysis and prediction, capacity planning, and admission control. Compared to the previous work, in this paper we present more detailed and rigorous statistical analysis of both request and session level characteristics of Web workload based on empirical data extracted from actual logs of four Web servers. Our analysis is focused on exploring phenomena such as self-similarity, long-range dependence, and heavy-tailed distributions. Identification of these phenomena in real data is a challenging task since the existing methods may perform erratically in practice and produce misleading results. We provide more accurate analysis of long-range dependence of the request and session arrival processes by removing the trend and periodicity. In addition to the session arrival process (i.e., inter-session characteristics), we study several intra-session characteristics using several different methods to test the existence of heavy-tailed behavior and cross validate the results. Finally, we point out specific problems associated with the methods used for establishing long-range dependence and heavy-tailed behavior of Web workloads. We believe that the comprehensive model presented in this paper is a step towards solving the Web workload puzzle
Keywords :
"Telecommunication traffic","Traffic control","Web sites","Performance analysis","Admission control","Web server","Computer science","Capacity planning","Statistical analysis","Data mining"
Publisher :
ieee
Conference_Titel :
Dependable Systems and Networks, 2006. DSN 2006. International Conference on
Print_ISBN :
0-7695-2607-1
Type :
conf
DOI :
10.1109/DSN.2006.2
Filename :
1633539
Link To Document :
بازگشت