Title
One SQL to Rule Them All - an Efficient and Syntactically Idiomatic Approach to Management of Streams and Tables
Abstract
Real-time data analysis and management are increasingly critical for today's businesses. SQL is the de facto lingua franca for these endeavors, yet support for robust streaming analysis and management with SQL remains limited. Many approaches restrict semantics to a reduced subset of features and/or require a suite of non-standard constructs. Additionally, use of event timestamps to provide native support for analyzing events according to when they actually occurred is not pervasive, and often comes with important limitations. We present a three-part proposal for integrating robust streaming into SQL, namely: (1) time-varying relations as a foundation for classical tables as well as streaming data, (2) event time semantics, (3) a limited set of optional keyword extensions to control the materialization of time-varying query results. We show how with these minimal additions it is possible to utilize the complete suite of standard SQL semantics to perform robust stream processing. We motivate and illustrate these concepts using examples and describe lessons learned from implementations in Apache Calcite, Apache Flink, and Apache Beam. We conclude with syntax and semantics of a concrete proposal for extensions of the SQL standard and note further areas of exploration.
Year
DOI
Venue
2019
10.1145/3299869.3314040
Proceedings of the 2019 International Conference on Management of Data
Keywords
DocType
ISSN
data management, query processing, stream processing
Conference
0730-8078
ISBN
Citations 
PageRank 
978-1-4503-5643-5
1
0.39
References 
Authors
0
6
Name
Order
Citations
PageRank
Edmon Begoli120.77
Tyler Akidau210.72
Fabian Hueske348920.81
Julian Hyde451.48
Kathryn Knight522.12
Kenneth W. Knowles621.45