Title
Network Sampling Based on Centrality Measures for Relational Classification.
Abstract
Many real-world networks, such as the Internet, social networks, biological networks, and others, are massive in size, which impairs their processing and analysis. To cope with this, the network size could be reduced without losing relevant information. In this paper, we extend a work that proposed a sampling method based on the following centrality measures: degree, k-core, clustering, eccentricity and structural holes. For our experiments, we remove 30% and 50% of the vertices and their edges from the original network. After, we evaluate our proposal on six real-world networks on relational classification task using eight different classifiers. Classification results achieved on sampled graphs generated from our proposal are similar to those obtained on the entire graphs. The execution time for learning step of the classifier is shorter on the sampled graph compared to the entire graph and random sampling. In most cases, the original graph was reduced by up to 50% of its initial number of edges without losing topological properties.
Year
DOI
Venue
2016
10.1007/978-3-319-55209-5_4
Communications in Computer and Information Science
Keywords
Field
DocType
Network sampling,Relational classification,Centrality measures,Missing data,Complex networks
Vertex (geometry),Pattern recognition,Biological network,Computer science,Centrality,Sampling (statistics),Artificial intelligence,Complex network,Missing data,Cluster analysis,Classifier (linguistics)
Conference
Volume
ISSN
Citations 
656
1865-0929
0
PageRank 
References 
Authors
0.34
0
5