Title
Flare & Lantern: Efficiently Swapping Horses Midstream.
Abstract
Running machine learning (ML) workloads at scale is as much a data management problem as a model engineering problem. Big performance challenges exist when data management systems invoke ML classifiers as user-defined functions (UDFs) or when stand-alone ML frameworks interact with data stores for data loading and pre-processing (ETL). In particular, UDFs can be precompiled or simply a black box for the data management system and the data layout may be completely different from the native layout, thus adding overheads at the boundaries. In this demo, we will show how bottlenecks between existing systems can be eliminated when their engines are designed around runtime compilation and native code generation, which is the case for many state-of-the-art relational engines as well as ML frameworks. We demonstrate an integration of Flare (an accelerator for Spark SQL), and Lantern (an accelerator for TensorFlow and PyTorch) that results in a highly optimized end-to-end compiled data path, switching between SQL and ML processing with negligible overhead.
Year
DOI
Venue
2019
10.14778/3352063.3352097
PVLDB
Field
DocType
Volume
Midstream,Swap (computer programming),Computer science,Flare,Lantern,Database
Journal
12
Issue
ISSN
Citations 
12
2150-8097
0
PageRank 
References 
Authors
0.34
0
5
Name
Order
Citations
PageRank
Grégory Essertel1104.24
Ruby Y. Tahboub2143.74
Fei Wang35415.10
James M. Decker452.09
Tiark Rompf574345.86