Title
Web Structure Mining by Isolated Cliques
Abstract
The link structure of the Web is generally viewed as the webgraph. Web structure mining is a research area that mainly aims to find hidden communities by focusing on the webgraph, and communities or their cores are supposed to constitute dense subgraphs. Therefore, structure mining can actually be realized by enumerating such substructures, and Kleinberg's biclique model is well-known among them. In this paper, we examine some candidate substructures, including conventional bicliques, and attempt to find useful information from the real web data. Especially, we newly exploit isolated cliques for our experiments of structure mining. As a result, we discovered that isolated cliques that lie over multiple domains can stand for useful communities, which implies the validity of isolated clique as a candidate substructure for structure mining. On the other hand, we also observed that most of isolated cliques on the Web correspond to menu structures and are inherent in single domains, and that isolated cliques can be quite useful for detecting harmful link farms.
Year
DOI
Venue
2007
10.1093/ietisy/e90-d.12.1998
IEICE Transactions
Keywords
Field
DocType
structure mining,real web data,web structure mining,isolated cliques,link structure,menu structure,isolated clique,useful community,harmful link farm,useful information,candidate substructure
Data mining,Complete bipartite graph,Structure mining,Webgraph,Clique,Link analysis,Computer science,Exploit,Web community,Link farm
Journal
Volume
Issue
ISSN
E90-D
12
1745-1361
Citations 
PageRank 
References 
1
0.35
11
Authors
3
Name
Order
Citations
PageRank
yushi uno122228.80
Yoshinobu Ota231.09
Akio Uemichi330.75