Title
Lambda architecture for cost-effective batch and speed big data processing
Abstract
Sensor and smart phone technologies present opportunities for data explosion, streaming and collecting from heterogeneous devices every second. Analyzing these large datasets can unlock multiple behaviors previously unknown, and help optimize approaches to city wide applications or societal use cases. However, collecting and handling of these massive datasets presents challenges in how to perform optimized online data analysis `on-the-fly¿, as current approaches are often limited by capability, expense and resources. This presents a need for developing new methods for data management particularly using public clouds to minimize cost, network resources and on-demand availability. This paper presents an implementation of the lambda architecture design pattern to construct a data-handling backend on Amazon EC2, providing high throughput, dense and intense data demand delivered as services, minimizing the cost of the network maintenance. This paper combines ideas from database management, cost models, query management and cloud computing to present a general architecture that could be applied in any given scenario where affordable online data processing of Big Datasets is needed. The results are presented with a case study of processing router sensor data on the current ESnet network data as a working example of the approach. The results showcase a reduction in cost and argue benefits for performing online analysis and anomaly detection for sensor data.
Year
DOI
Venue
2015
10.1109/BigData.2015.7364082
Big Data
Keywords
Field
DocType
big data processing, lambda architecture, Amazon EC2, sensor data analysis
Data warehouse,Data mining,Anomaly detection,Data architecture,COLA (software architecture),Computer science,Data virtualization,Data management,Big data,Cloud computing
Conference
Citations 
PageRank 
References 
15
1.11
21
Authors
5
Name
Order
Citations
PageRank
Mariam Kiran112117.83
Peter Murphy2151.11
Inder Monga322019.18
Jon Dugan4151.11
Sartaj Singh Baveja5151.11