Title | ||
---|---|---|
Resilient distributed datasets: a fault-tolerant abstraction for in-memory cluster computing |
Abstract | ||
---|---|---|
We present Resilient Distributed Datasets (RDDs), a distributed memory abstraction that lets programmers perform in-memory computations on large clusters in a fault-tolerant manner. RDDs are motivated by two types of applications that current computing frameworks handle inefficiently: iterative algorithms and interactive data mining tools. In both cases, keeping data in memory can improve performance by an order of magnitude. To achieve fault tolerance efficiently, RDDs provide a restricted form of shared memory, based on coarse-grained transformations rather than fine-grained updates to shared state. However, we show that RDDs are expressive enough to capture a wide class of computations, including recent specialized programming models for iterative jobs, such as Pregel, and new applications that these models do not capture. We have implemented RDDs in a system called Spark, which we evaluate through a variety of user applications and benchmarks. |
Year | Venue | Keywords |
---|---|---|
2012 | NSDI | fault-tolerant manner,fault-tolerant abstraction,interactive data mining tool,memory abstraction,shared memory,in-memory cluster computing,iterative job,shared state,fault tolerance,iterative algorithm,current computing framework,coarse-grained transformation |
Field | DocType | Citations |
Abstraction,COLA (software architecture),Spark (mathematics),Shared memory,Programming paradigm,Computer science,Distributed memory,Real-time computing,Fault tolerance,Computer cluster,Distributed computing | Conference | 1255 |
PageRank | References | Authors |
44.75 | 32 | 9 |
Name | Order | Citations | PageRank |
---|---|---|---|
Matei Zaharia | 1 | 9101 | 407.89 |
Mosharaf Chowdhury | 2 | 4807 | 198.24 |
Tathagata Das | 3 | 2580 | 97.96 |
Ankur Dave | 4 | 1917 | 67.99 |
Justin Ma | 5 | 2314 | 104.86 |
Murphy McCauley | 6 | 1255 | 45.08 |
Michael J. Franklin | 7 | 17423 | 1681.10 |
Scott Shenker | 8 | 29892 | 2677.04 |
I. Stoica | 9 | 21406 | 1710.11 |