Abstract | ||
---|---|---|
There are hundreds of online social networks with altogether billions of users. Many such networks publicly release structural information, with all personal information removed. Empirical studies have shown, however, that this provides a false sense of privacy—it is possible to identify almost all users that appear in two such anonymized network as long as a few initial mappings are known. We analyze this problem theoretically by reconciling two versions of an artificial power-law network arising from independent subsampling of vertices and edges. We present a new algorithm that identifies most vertices and makes no wrong identifications with high probability. The number of vertices matched is shown to be asymptotically optimal. For an n-vertex graph, our algorithm uses \(n^\varepsilon \) seed nodes (for an arbitrarily small \(\varepsilon \)) and runs in quasilinear time. This improves previous theoretical results which need \(\Theta (n)\) seed nodes and have runtimes of order \(n^{1+\Omega (1)}\). Additionally, the applicability of our algorithm is studied experimentally on different networks. |
Year | DOI | Venue |
---|---|---|
2014 | 10.1007/s00453-017-0395-0 | Algorithmica |
Keywords | Field | DocType |
Social networks,Locality-sensitive hashing,Network privacy | Locality-sensitive hashing,Discrete mathematics,De-anonymization,Random graph,Social network,Computer science,Theoretical computer science,Personally identifiable information,Empirical research | Conference |
Volume | Issue | ISSN |
80 | 11 | 0178-4617 |
Citations | PageRank | References |
2 | 0.38 | 21 |
Authors | ||
3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Karl Bringmann | 1 | 427 | 30.13 |
Tobias Friedrich | 2 | 54 | 8.19 |
Anton Krohmer | 3 | 36 | 5.89 |