Title | ||
---|---|---|
Efficient Similarity-Based Alignment Of Temporally-Situated Graph Nodes With Apache Spark |
Abstract | ||
---|---|---|
Topic evolution networks are widely used to represent the evolution of research topics in scientific document archives. These networks might contain thousands of topics and alignment edges which are computed by comparing millions of topic pairs with some similarity function. In this work, we are addressing the problem of computing a very large number cosine-based topic alignments on top of Apache Spark. We present the native map-reduce implementation proposed by Spark and a more efficient implementation which is tuned for alignment computation. Both implementations are evaluated on three real-world datasets. |
Year | DOI | Venue |
---|---|---|
2019 | 10.1109/BigData47090.2019.9005483 | 2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA) |
Field | DocType | ISSN |
Situated,Data mining,Graph,Trigonometric functions,Spark (mathematics),Computer science,Implementation,Large numbers,Computation | Conference | 2639-1589 |
Citations | PageRank | References |
0 | 0.34 | 0 |
Authors | ||
4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Hubert Naacke | 1 | 128 | 25.41 |
Ke Li | 2 | 0 | 0.34 |
Bernd Amann | 3 | 425 | 59.99 |
Olivier Curé | 4 | 102 | 27.53 |