Title
Automatically Spotting Information-Rich Nodes in Graphs
Abstract
How to automatically spot the most interesting nodes in a large graph? Although graph mining has attracted lots of attention this question remains challenging for the following reasons: (1) It is not trivial to define what an interesting node is. In some cases, application-specific background knowledge may help but is not always available. Moreover, the notion of interestingness depends on the structure of the graph. In a sparse graph, a hub or a node at the center of a star-like or clique-like sub graph might be interesting. However, if the graph is densely connected, a remote node might be more interesting. (2) Most existing approaches to detect abnormal nodes involve parameters which are difficult to estimate. Our approach addresses both challenges by considering the problem from an information-theoretic perspective. An interesting node exhibits an outstanding link pattern which contributes much nonredundant information to the overall information content of the graph. Based on this basic idea, we define an intuitive similarity measure between nodes. Combined with metric embedding, this similarity measure allows spotting the most interesting nodes at first glance. Moreover, we propose Info-spot, a parameter free and efficient algorithm for fully automatic detection of interesting nodes. Guided by the objective of data compression, Info-spot greedily merges nodes exhibiting similar link patterns, while exceptional information-rich nodes naturally emerge. Extensive experiments demonstrate the benefits of our approach.
Year
DOI
Venue
2011
10.1109/ICDMW.2011.37
ICDM Workshops
Keywords
DocType
ISBN
information-theoretic,automatic information-rich node spotting,rich link pattern,remote node,outstanding nodes,existing approach,nonredundant information,clique-like sub graph,automatically spotting information-rich nodes,star-like subgraph,sparse graph,automatically detection,large graph,abnormal node,graph mining,data mining,graph theory,clique-like subgraph,information-theoretic perspective,automatic interesting node detection,interesting node,info-spot algorithm,abnormal node detection,intuitive similarity measure,information content,data compression
Conference
978-1-4673-0005-6
Citations 
PageRank 
References 
0
0.34
10
Authors
3
Name
Order
Citations
PageRank
Xiao He1644.65
Jing Feng2225.35
Claudia Plant353654.69