Title
Efficient Similarity-Based Alignment Of Temporally-Situated Graph Nodes With Apache Spark
Abstract
Topic evolution networks are widely used to represent the evolution of research topics in scientific document archives. These networks might contain thousands of topics and alignment edges which are computed by comparing millions of topic pairs with some similarity function. In this work, we are addressing the problem of computing a very large number cosine-based topic alignments on top of Apache Spark. We present the native map-reduce implementation proposed by Spark and a more efficient implementation which is tuned for alignment computation. Both implementations are evaluated on three real-world datasets.
Year
DOI
Venue
2019
10.1109/BigData47090.2019.9005483
2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA)
Field
DocType
ISSN
Situated,Data mining,Graph,Trigonometric functions,Spark (mathematics),Computer science,Implementation,Large numbers,Computation
Conference
2639-1589
Citations 
PageRank 
References 
0
0.34
0
Authors
4
Name
Order
Citations
PageRank
Hubert Naacke112825.41
Ke Li200.34
Bernd Amann342559.99
Olivier Curé410227.53