A framework and a performance assessment for serverless MapReduce on AWS Lambda - Citegraph

Paper Info

Title
A framework and a performance assessment for serverless MapReduce on AWS Lambda

Abstract
MapReduce is one of the most widely used programming models for analysing large-scale datasets, i.e. Big Data. In recent years, serverless computing and, in particular, Functions as a Service (FaaS) has surged as an execution model in which no explicit management of servers (e.g. virtual machines) is performed by the user. Instead, the Cloud provider dynamically allocates resources to the function invocations and fine-grained billing is introduced depending on the execution time and allocated memory, as exemplified by AWS Lambda. In this article, a high-performant serverless architecture has been created to execute MapReduce jobs on AWS Lambda using Amazon S3 as the storage backend. In addition, a thorough assessment has been carried out to study the suitability of AWS Lambda as a platform for the execution of High Throughput Computing jobs. The results indicate that AWS Lambda provides a convenient computing platform for general-purpose applications that fit within the constraints of the service (15 min of maximum execution time, 3008 MB of RAM and 512 MB of disk space) but it exhibits an inhomogeneous performance behaviour that may jeopardise adoption for tightly coupled computing jobs.

Year	DOI	Venue
2019	10.1016/j.future.2019.02.057	Future Generation Computer Systems
Keywords	Field	DocType
MapReduce,Serverless,Cloud computing,Elasticity	Architecture,Virtual machine,Programming paradigm,High-throughput computing,Computer science,Server,Execution model,Big data,Cloud computing,Distributed computing	Journal
Volume	ISSN	Citations
97	0167-739X	6
PageRank	References	Authors
0.59	0	3

Authors (3 rows)

Cited by (6 rows)

References (0 rows)

Name	Order	Citations	PageRank
V. Giménez-Alventosa	1	6	0.93
Germán Moltó	2	171	18.92
Miguel Caballer	3	172	16.90

1