Title
Mining the Most Interesting Web Access Associations
Abstract
Web access patterns can provide valuable information for website designers in making website-based communication more efficient. To extract interesting or useful web access patterns, we use data mining techniques which analyze historical web access logs. In this paper, we present an efficient approach to mine the most interesting web access associations, where the word "interesting" denotes patterns that are supported by a high fraction of access activities with strong confidence. Our approach consists of three steps: 1) transform raw web logs to a relational table; 2) convert the relational table to a collection of access transactions; 3) mine the transaction collection to extract associations and rules. In both step 1 and step 2, we provide users with an effective mechanism to help them generate only "interesting" access records and transactions for mining. In the third step, we present a new efficient data mining algorithm to find the most interesting web access associations. We evaluate this approach using both synthetic data sets and real web logs and show the efficacy, efficiency and good scalability of the proposed mining methods.
Year
Venue
Keywords
2000
WebNet
web accessibility,data mining,synthetic data
Field
DocType
Citations 
Data science,World Wide Web,Data stream mining,Text mining,Web mining,Web intelligence,Web access,Computer science,Web standards
Conference
2
PageRank 
References 
Authors
0.38
5
6
Name
Order
Citations
PageRank
Li Shen1863102.99
Ling Cheng220.38
James Ford322716.26
Fillia Makedon41676201.73
Vasileios Megalooikonomou556864.84
Tilmann Steinberg6143.10