Title
GeoMatch: Efficient Large-Scale Map Matching on Apache Spark
Abstract
We contribute by developing GeoMatch as a novel, scalable, and efficient big-data pipeline for large-scale map matching on Apache Spark. GeoMatch improves existing spatial big data solutions by utilizing a novel spatial partitioning scheme inspired by Hilbert space-filling curves. Thanks to the partitioning scheme, GeoMatch can effectively balance operations across different processing units and achieve significant performance gains. We demonstrate the effectiveness of GeoMatch through rigorous and extensive benchmarks that consider data sets containing large-scale urban spatial data sets ranging from 166, 253 to 3.78 billion location measurements. Our results show over 17-fold performance improvements compared to previous works while achieving better processing accuracy than current solutions (97.48%).
Year
DOI
Venue
2018
10.1109/BigData.2018.8622488
2018 IEEE International Conference on Big Data (Big Data)
Keywords
Field
DocType
Big Data,Spatial Data Analysis,Spatial Partitioning,Performance,Query Processing,Spark
Scale (map),Spatial analysis,Space partitioning,Data mining,Data set,Spark (mathematics),Computer science,Big data,Map matching,Scalability
Conference
ISSN
ISBN
Citations 
2639-1589
978-1-5386-5036-3
0
PageRank 
References 
Authors
0.34
0
6
Name
Order
Citations
PageRank
Ayman Zeidan100.34
Eemil Lagerspetz242729.56
Kai Zhao310413.74
Petteri Nurmi462157.08
Sasu Tarkoma51312125.76
Huy T. Vo6103561.10