Title
Towards Hybrid Programming in Big Data
Abstract
Within the past decade, there have been a number of parallel programming models developed for data-intensive (i.e., big data) applications. Typically, each model has its own strengths in performance or programmability for some kinds of applications but limitations for others. As a result, multiple programming models are often combined in a complimentary manner to exploit their merits and hide their weaknesses. However, existing models can only be loosely coupled due to their isolated runtime systems. In this paper, we present Transformer, the first system that supports hybrid programming models for data-intensive applications. Transformer has two unique contributions. First, Transformer offers a programming abstraction in a unified runtime system for different programming model implementations, such as Dryad, Spark, Pregel, and PowerGraph. Second, Transformer supports an efficient and transparent data sharing mechanism, which tightly integrates different programming models in a single program. Experimental results on Amazon's EC2 cloud show that Transformer can flexibly and efficiently support hybrid programming models for data-intensive computing.
Year
Venue
Field
2015
USENIX Workshop on Hot Topics in Cloud Computing
Functional reactive programming,System programming,Programming paradigm,Computer science,Inductive programming,Real-time computing,Extensible programming,Reactive programming,Concurrent object-oriented programming,Distributed computing,Runtime system
DocType
Citations 
PageRank 
Conference
3
0.38
References 
Authors
18
4
Name
Order
Citations
PageRank
Peng Wang15010.52
Hong Jiang230424.60
Xu Liu328324.55
Jizhong Han435554.72