Abstract | ||
---|---|---|
MapReduce is one of the most widely used programming models for analysing large-scale datasets, i.e. Big Data. In recent years, serverless computing and, in particular, Functions as a Service (FaaS) has surged as an execution model in which no explicit management of servers (e.g. virtual machines) is performed by the user. Instead, the Cloud provider dynamically allocates resources to the function invocations and fine-grained billing is introduced depending on the execution time and allocated memory, as exemplified by AWS Lambda. In this article, a high-performant serverless architecture has been created to execute MapReduce jobs on AWS Lambda using Amazon S3 as the storage backend. In addition, a thorough assessment has been carried out to study the suitability of AWS Lambda as a platform for the execution of High Throughput Computing jobs. The results indicate that AWS Lambda provides a convenient computing platform for general-purpose applications that fit within the constraints of the service (15 min of maximum execution time, 3008 MB of RAM and 512 MB of disk space) but it exhibits an inhomogeneous performance behaviour that may jeopardise adoption for tightly coupled computing jobs. |
Year | DOI | Venue |
---|---|---|
2019 | 10.1016/j.future.2019.02.057 | Future Generation Computer Systems |
Keywords | Field | DocType |
MapReduce,Serverless,Cloud computing,Elasticity | Architecture,Virtual machine,Programming paradigm,High-throughput computing,Computer science,Server,Execution model,Big data,Cloud computing,Distributed computing | Journal |
Volume | ISSN | Citations |
97 | 0167-739X | 6 |
PageRank | References | Authors |
0.59 | 0 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
V. Giménez-Alventosa | 1 | 6 | 0.93 |
Germán Moltó | 2 | 171 | 18.92 |
Miguel Caballer | 3 | 172 | 16.90 |