Abstract | ||
---|---|---|
Many real-world networks, such as the Internet, social networks, biological networks, and others, are massive in size, which impairs their processing and analysis. To cope with this, the network size could be reduced without losing relevant information. In this paper, we extend a work that proposed a sampling method based on the following centrality measures: degree, k-core, clustering, eccentricity and structural holes. For our experiments, we remove 30% and 50% of the vertices and their edges from the original network. After, we evaluate our proposal on six real-world networks on relational classification task using eight different classifiers. Classification results achieved on sampled graphs generated from our proposal are similar to those obtained on the entire graphs. The execution time for learning step of the classifier is shorter on the sampled graph compared to the entire graph and random sampling. In most cases, the original graph was reduced by up to 50% of its initial number of edges without losing topological properties. |
Year | DOI | Venue |
---|---|---|
2016 | 10.1007/978-3-319-55209-5_4 | Communications in Computer and Information Science |
Keywords | Field | DocType |
Network sampling,Relational classification,Centrality measures,Missing data,Complex networks | Vertex (geometry),Pattern recognition,Biological network,Computer science,Centrality,Sampling (statistics),Artificial intelligence,Complex network,Missing data,Cluster analysis,Classifier (linguistics) | Conference |
Volume | ISSN | Citations |
656 | 1865-0929 | 0 |
PageRank | References | Authors |
0.34 | 0 | 5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Lilian Berton | 1 | 16 | 7.82 |
Didier A. Vega-Oliveros | 2 | 24 | 5.37 |
Jorge Carlos Valverde-Rebaza | 3 | 79 | 8.11 |
André Tavares da Silva | 4 | 46 | 2.93 |
Alneu de Andrade Lopes | 5 | 275 | 29.00 |