Title
Approximate graph mining with label costs
Abstract
Many real-world graphs have complex labels on the nodes and edges. Mining only exact patterns yields limited insights, since it may be hard to find exact matches. However, in many domains it is relatively easy to define a cost (or distance) between different labels. Using this information, it becomes possible to mine a much richer set of approximate subgraph patterns, which preserve the topology but allow bounded label mismatches. We present novel and scalable methods to efficiently solve the approximate isomorphism problem. We show that approximate mining yields interesting patterns in several real-world graphs ranging from IT and protein interaction networks to protein structures.
Year
DOI
Venue
2013
10.1145/2487575.2487602
KDD
Keywords
Field
DocType
complex label,bounded label mismatches,exact match,protein interaction network,protein structure,label cost,approximate mining yield,real-world graph,approximate graph mining,approximate isomorphism problem,approximate subgraph pattern,exact patterns yield
Data mining,Graph isomorphism,Computer science,Molecule mining,Induced subgraph isomorphism problem,Power graph analysis,Isomorphism,Artificial intelligence,Subgraph isomorphism problem,Machine learning,Bounded function,Scalability
Conference
Citations 
PageRank 
References 
13
0.50
17
Authors
5
Name
Order
Citations
PageRank
Pranay Anchuri1502.27
Mohammed Javeed Zaki27972536.24
Omer Barkol31027.78
Shahar Golan4575.72
Moshe Shamy5130.50