Abstract | ||
---|---|---|
This paper studies the effectiveness of pushing parts of DBMS analytics queries into the Simple Storage Service (S3) of Amazon Web Services (AWS), using a recently released capability called S3 Select. We show that some DBMS primitives (filter, projection, and aggregation) can always be cost-effectively moved into S3. Other more complex operations (join, top-K, and group by) require reimplementation to take advantage of S3 Select and are often candidates for pushdown. We demonstrate these capabilities through experimentation using a new DBMS that we developed, PushdownDB. Experimentation with a collection of queries including TPC-H queries shows that PushdownDB is on average 30% cheaper and 6.7x faster than a baseline that does not use S3 Select. |
Year | DOI | Venue |
---|---|---|
2020 | 10.1109/ICDE48307.2020.00174 | 2020 IEEE 36TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2020) |
DocType | ISSN | Citations |
Conference | 1084-4627 | 0 |
PageRank | References | Authors |
0.34 | 6 | 7 |
Name | Order | Citations | PageRank |
---|---|---|---|
Xiangyao Yu | 1 | 270 | 16.17 |
Youill Matt | 2 | 0 | 0.34 |
Woicik Matthew | 3 | 0 | 0.34 |
Abdurrahman Ghanem | 4 | 12 | 2.88 |
Marco Serafini | 5 | 229 | 14.33 |
Ashraf Aboulnaga | 6 | 1289 | 91.33 |
Michael Stonebraker | 7 | 12463 | 4310.17 |