Title
A framework and a performance assessment for serverless MapReduce on AWS Lambda
Abstract
MapReduce is one of the most widely used programming models for analysing large-scale datasets, i.e. Big Data. In recent years, serverless computing and, in particular, Functions as a Service (FaaS) has surged as an execution model in which no explicit management of servers (e.g. virtual machines) is performed by the user. Instead, the Cloud provider dynamically allocates resources to the function invocations and fine-grained billing is introduced depending on the execution time and allocated memory, as exemplified by AWS Lambda. In this article, a high-performant serverless architecture has been created to execute MapReduce jobs on AWS Lambda using Amazon S3 as the storage backend. In addition, a thorough assessment has been carried out to study the suitability of AWS Lambda as a platform for the execution of High Throughput Computing jobs. The results indicate that AWS Lambda provides a convenient computing platform for general-purpose applications that fit within the constraints of the service (15 min of maximum execution time, 3008 MB of RAM and 512 MB of disk space) but it exhibits an inhomogeneous performance behaviour that may jeopardise adoption for tightly coupled computing jobs.
Year
DOI
Venue
2019
10.1016/j.future.2019.02.057
Future Generation Computer Systems
Keywords
Field
DocType
MapReduce,Serverless,Cloud computing,Elasticity
Architecture,Virtual machine,Programming paradigm,High-throughput computing,Computer science,Server,Execution model,Big data,Cloud computing,Distributed computing
Journal
Volume
ISSN
Citations 
97
0167-739X
6
PageRank 
References 
Authors
0.59
0
3
Name
Order
Citations
PageRank
V. Giménez-Alventosa160.93
Germán Moltó217118.92
Miguel Caballer317216.90