HPAT: High Performance Analytics with Scripting Ease-of-Use. - Citegraph

Paper Info

Title
HPAT: High Performance Analytics with Scripting Ease-of-Use.

Abstract
Big data analytics requires high programmer productivity and high performance simultaneously on large-scale clusters. However, current big data analytics frameworks (e.g. Apache Spark) have prohibitive runtime overheads since they are library-based. We introduce a novel auto-parallelizing compiler approach that exploits the characteristics of the data analytics domain such as the map/reduce parallel pattern and is robust, unlike previous auto-parallelization methods. Using this approach, we build High Performance Analytics Toolkit (HPAT), which parallelizes high-level scripting (Julia) programs automatically, generates efficient MPI/C++ code, and provides resiliency. Furthermore, it provides automatic optimizations for scripting programs, such as fusion of array operations. Thus, HPAT is 369X to 2033X faster than Spark on the Cori supercomputer and 20X to 256X times on Amazon AWS.

Year	DOI	Venue
2017	10.1145/3079079.3079099	Proceedings of the International Conference on Supercomputing
DocType	ISBN	Citations
Conference	978-1-4503-5020-4	3
PageRank	References	Authors
0.37	30	3

Authors (3 rows)

Cited by (3 rows)

References (30 rows)

Name	Order	Citations	PageRank
Ehsan Totoni	1	74	7.77
Todd A Anderson	2	151	21.51
Tatiana Shpeisman	3	436	32.69

1