Title | ||
---|---|---|
A Comparison Of Big Data Application Programming Approaches: A Travel Companion Case Study |
Abstract | ||
---|---|---|
With advances of big data technologies, there are many possible ways to program for each big data application. A challenge is to know the differences of the program approaches and decide which programming approach is the best for a particular big data application. In this paper, we use vehicle travel companion as a case study to explore four different programming approaches, including Spark RDD (with GroupBy or Join), Spark SQL with Hive and Hive on Hadoop, and tune the programmed big data applications. Our experiments show that the execution time of one programming approach could be more than 100-fold longer than that of another for the same application logic, which verifies that programming approach decision is important. We also explain the reasons for the differences. The findings could be applied to the selection of programming approach for other big data applications. |
Year | DOI | Venue |
---|---|---|
2017 | 10.1109/BigData.2017.8258255 | 2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA) |
Keywords | DocType | ISSN |
Big Data, Spark, Hive, SQL, Programming Comparison, Benchmarking, Travel Companion | Conference | 2639-1589 |
Citations | PageRank | References |
0 | 0.34 | 0 |
Authors | ||
3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Pei Guo | 1 | 0 | 0.34 |
Jianwu Wang | 2 | 215 | 26.72 |
Zhiyuan Chen | 3 | 40 | 2.21 |