Title
The RoadRunner framework for efficient and scalable processing of big data
Abstract
In this paper, we present the overall architecture of RoadRunner, a Hadoop-based framework that enhances the efficiency of rank-aware query processing by introducing various optimizations to Hadoop, without changing its internal operation. RoadRunner focuses on a specific class of queries that involve ranking, such as top-k queries and top-k joins, as well as on preference-aware queries, such as skyline queries, which are tightly related. For this class of queries, we identify improvements on various stages of MapReduce processing, which result in improved performance without sacrificing scalability. We describe the RoadRunner framework, along with individual modules and their roles, and we demonstrate the merits of the proposed framework by means of showcase query examples.
Year
DOI
Venue
2015
10.1145/2801948.2801963
Panhellenic Conference on Informatics
Field
DocType
Citations 
Skyline,Data mining,Joins,Architecture,Ranking,Computer science,Roadrunner,Big data,Database,Scalability
Conference
0
PageRank 
References 
Authors
0.34
23
5
Name
Order
Citations
PageRank
Christos Doulkeridis189955.91
Akrivi Vlachou275139.95
Panagiotis Nikitopoulos343.49
Panagiotis Tampakis4195.18
Mei Saouk510.69