Title | ||
---|---|---|
SigMR: MapReduce-based SPARQL query processing by signature encoding and multi-way join |
Abstract | ||
---|---|---|
Large numbers of Resource Description Framework triples are available in Linked Data which can grow exponentially. It makes SPARQL query processing engines infeasible on a single machine. To address this scalability issue, MapReduce framework-based SPARQL engines have been proposed, but we note that these methods are limited in terms of join evaluations. The two-way join-based approach evaluates joins via a sequence of binary multiplications that require multiple MapReduce jobs, which involves costly disk accesses between MapReduce jobs. The multi-way join-based approach combines multiple two-way join operations, which allows the simultaneous evaluation of joins during one MapReduce job. However, the size of data for the MapReduce job might increase exponentially if a complex query is given. In this study, we propose SigMR, a pruning method for multi-way join-based SPARQL query processing in MapReduce. In the proposed approach, a SPARQL query can be evaluated in a single MapReduce job, where the size of data is reduced dramatically by pruning based on our signature encoding technique, thereby overcoming the weaknesses of the previous approaches. In experiments, we showed that the query processing time required was lower with our approach than existing MapReduce-based methods. |
Year | DOI | Venue |
---|---|---|
2015 | 10.1007/s11227-015-1459-z | The Journal of Supercomputing |
Keywords | Field | DocType |
Hadoop,MapReduce,Multi-way join,Signature encoding,SigMR,SPARQL | Joins,Computer science,Parallel computing,Linked data,Sort-merge join,SPARQL,RDF,Encoding (memory),Distributed computing,Scalability,Binary number | Journal |
Volume | Issue | ISSN |
71 | 10 | 0920-8542 |
Citations | PageRank | References |
3 | 0.40 | 32 |
Authors | ||
3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Jinhyun Ahn | 1 | 25 | 5.65 |
Dong-Hyuk Im | 2 | 35 | 6.06 |
Hong-Gee Kim | 3 | 225 | 22.83 |