Title
Contracted webgraphs: structure mining and scale-freeness
Abstract
The link structure of the Web is generally viewed as the webgraph. One of the main objectives of web structure mining is to find hidden communities on the Web based on the webgraph, and one of its approaches tries to enumerate substructures each of which corresponds to a set of web pages of a community or its core. Through those research, it has been turned out that certain substructures can find sets of pages that are inherently irrelevant to communities. In this paper, we propose a model, which we call contracted webgraphs, where such substructures are contracted into single nodes to hide useless information. We then try structure mining iteratively on those contracted webgraphs since we can expect to find further hidden information once irrelevant information is eliminated. We also explore structural properties of contracted webgraphs from the viewpoint of scale-freeness, and we observe that they exhibit novel and extreme self-similarities.
Year
DOI
Venue
2011
10.1007/978-3-642-21204-8_31
FAW-AAIM
Keywords
Field
DocType
hidden community,structure mining iteratively,web page,web structure mining,extreme self-similarities,certain substructure,link structure,irrelevant information,useless information,hidden information
Structure mining,World Wide Web,Webgraph,Web page,Degree distribution,Engineering,Web structure
Conference
Volume
ISSN
Citations 
6681
0302-9743
0
PageRank 
References 
Authors
0.34
18
2
Name
Order
Citations
PageRank
yushi uno122228.80
Fumiya Oguri200.68