Abstract | ||
---|---|---|
Proliferation of RDF data has reached to a peak where data is partitioned across multiple nodes. Significant contribution for developing solutions to manage RDF data in distributed environment is witnessed in recent years. We propose a workload aware hybrid partitioning approach for a distributed environment. The objective of our approach is reducing query joins and inter-node communication leading it to faster query execution for frequent queries. Our approach considers a query workload and partitions data based on workload information. It distributes data by exploiting underlying structural relationship between properties using a property reachability matrix to optimize query performance. DWAHP gets rid of inter-node communication cost for frequent queries like linear and star queries and answers 83% of frequent query workload without inter-node communication. DWAHP is compared with state-of-the-art solutions in terms of query execution time, query cost, storage space, and inter-node communication. It has demonstrated significant improvement over state-of-the-art solution. |
Year | DOI | Venue |
---|---|---|
2017 | 10.1145/3105831.3105864 | IDEAS |
Field | DocType | ISBN |
Query optimization,Web search query,Data mining,RDF query language,Query expansion,Computer science,Sargable,Web query classification,Theoretical computer science,Spatial query,Database,Boolean conjunctive query | Conference | 978-1-4503-5220-8 |
Citations | PageRank | References |
3 | 0.55 | 12 |
Authors | ||
2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Trupti Padiya | 1 | 9 | 2.19 |
Minal Bhise | 2 | 11 | 3.24 |