Title
Analysis of web logs: challenges and findings
Abstract
Web logs are an important source of information to describe and understand the traffic of the servers and its characteristics. The analysis of these logs is rather challenging because of the large volume of data and the complex relationships hidden in these data. Our investigation focuses on the analysis of the logs of two Web servers and identifies the main characteristics of their workload and the navigation profiles of crawlers and human users visiting the sites. The classification of these visitors has shown some interesting similarities and differences in term of traffic intensity and its temporal distribution. In general, crawlers tend to re-visit the sites rather often, even though they seldom send bursts of requests to reduce their impact on the servers resources. The other clients are also characterized by periodic patterns that can be effectively represented by few principal components.
Year
DOI
Venue
2010
10.1007/978-3-642-25575-5_19
PERFORM
Keywords
Field
DocType
complex relationship,traffic intensity,human user,servers resource,large volume,main characteristic,interesting similarity,important source,web log,web server,navigation profile
World Wide Web,Workload,Computer science,Server,Traffic intensity,Database,Principal component analysis,Web server
Conference
Citations 
PageRank 
References 
6
0.46
16
Authors
2
Name
Order
Citations
PageRank
Maria Carla Calzarossa17011.31
Luisa Massari210411.19