Title
TFX: A TensorFlow-Based Production-Scale Machine Learning Platform
Abstract
Creating and maintaining a platform for reliably producing and deploying machine learning models requires careful orchestration of many components---a learner for generating models based on training data, modules for analyzing and validating both data as well as models, and finally infrastructure for serving models in production. This becomes particularly challenging when data changes over time and fresh models need to be produced continuously. Unfortunately, such orchestration is often done ad hoc using glue code and custom scripts developed by individual teams for specific use cases, leading to duplicated effort and fragile systems with high technical debt. We present TensorFlow Extended (TFX), a TensorFlow-based general-purpose machine learning platform implemented at Google. By integrating the aforementioned components into one platform, we were able to standardize the components, simplify the platform configuration, and reduce the time to production from the order of months to weeks, while providing platform stability that minimizes disruptions. We present the case study of one deployment of TFX in the Google Play app store, where the machine learning models are refreshed continuously as new data arrive. Deploying TFX led to reduced custom code, faster experiment cycles, and a 2% increase in app installs resulting from improved data and model analysis.
Year
DOI
Venue
2017
10.1145/3097983.3098021
KDD
Keywords
Field
DocType
large-scale machine learning,end-to-end platform,continuous training
Training set,Data mining,Use case,Software deployment,App store,Computer science,Glue code,Artificial intelligence,Technical debt,Orchestration (computing),Machine learning,Scripting language
Conference
ISBN
Citations 
PageRank 
978-1-4503-4887-4
29
1.14
References 
Authors
13
22
Name
Order
Citations
PageRank
Denis Baylor1291.14
Eric Breck245148.62
Heng-Tze Cheng361226.54
Noah Fiedel4291.14
Chuan Yu Foo5291.14
Zakaria Haque636210.60
Salem Haykal7331.93
Mustafa Ispir836210.94
Vihan Jain935912.86
Levent Koc1040812.83
Chiu Yuen Koo11291.14
Lukasz Lew12291.14
Clemens Mewald13331.61
Akshay Naresh Modi14291.48
Neoklis Polyzotis152078138.76
Sukriti Ramesh16321.53
Sudip Roy1730929.71
Steven Euijong Whang18562.93
Martin Wicke19214073.79
Jarek Wilkiewicz20291.14
Xin Zhang2121889.32
Martin Zinkevich221893160.99