Title
F1 query: declarative querying at scale
Abstract
AbstractF1 Query is a stand-alone, federated query processing platform that executes SQL queries against data stored in different file-based formats as well as different storage systems at Google (e.g., Bigtable, Spanner, Google Spreadsheets, etc.). F1 Query eliminates the need to maintain the traditional distinction between different types of data processing workloads by simultaneously supporting: (i) OLTP-style point queries that affect only a few records; (ii) low-latency OLAP querying of large amounts of data; and (iii) large ETL pipelines. F1 Query has also significantly reduced the need for developing hard-coded data processing pipelines by enabling declarative queries integrated with custom business logic. F1 Query satisfies key requirements that are highly desirable within Google: (i) it provides a unified view over data that is fragmented and distributed over multiple data sources; (ii) it leverages datacenter resources for performant query processing with high throughput and low latency; (iii) it provides high scalability for large data sizes by increasing computational parallelism; and (iv) it is extensible and uses innovative approaches to integrate complex business logic in declarative query processing. This paper presents the end-to-end design of F1 Query. Evolved out of F1, the distributed database originally built to manage Google's advertising data, F1 Query has been in production for multiple years at Google and serves the querying needs of a large number of users and systems.
Year
DOI
Venue
2018
10.14778/3229863.3229871
Hosted Content
Field
DocType
Volume
SQL,Data processing,Computer science,Business logic,Data type,Distributed database,Online analytical processing,Spanner,Database,Scalability
Journal
11
Issue
ISSN
Citations 
12
2150-8097
0
PageRank 
References 
Authors
0.34
0
36
Name
Order
Citations
PageRank
Bart Samwel1896.72
John Cieslewicz233519.95
Ben Handy300.34
Jason Govig4332.43
Petros Venetis500.34
Chanjun Yang600.68
Keith Peters700.68
Jeff Shute81097.46
Daniel Tenedorio900.68
Himani Apte10533.32
Felix Weigel1100.34
David Wilhite1200.68
Jiacheng Yang1300.34
Jun Xu14161.24
Jiexing Li1521110.36
Zhan Yuan1600.34
Craig Chasseur1700.34
Zeng Qiang183410.73
Ian Rae19863.56
Anurag Biyani2000.34
Andrew Harn2100.34
Yang Xia2200.34
Andrey Gubichev2300.34
Amr El-Helw2400.34
Orri Erling2548932.75
Zhepeng Yan2600.68
Mohan Yang27716.86
Yiqun Wei2800.68
Thanh Do2900.68
Colin Zheng3000.34
Goetz Graefe3100.34
Somayeh Sardashti3200.34
Ahmed M. Aly3300.68
Divy Agrawal3400.68
Ashish Gupta35331.75
Divyakant Agrawal3682011674.75