Title
Taiji: managing global user traffic for large-scale internet services at the edge
Abstract
We present Taiji, a new system for managing user traffic for large-scale Internet services that accomplishes two goals: 1) balancing the utilization of data centers and 2) minimizing network latency of user requests. Taiji models edge-to-datacenter traffic routing as an assignment problem---assigning traffic objects at the edge to the data centers to satisfy service-level objectives. Taiji uses a constraint optimization solver to generate an optimal routing table that specifies the fractions of traffic each edge node will distribute to different data centers. Taiji continuously adjusts the routing table to accommodate the dynamics of user traffic and failure events that reduce capacity. Taiji leverages connections among users to selectively route traffic of highly-connected users to the same data centers based on fractions in the routing table. This routing strategy, which we term connection-aware routing, allows us to reduce query load on our backend storage by 17%. Taiji has been used in production at Facebook for more than four years and routes global traffic in a user-aware manner for several large-scale product services across dozens of edge nodes and data centers.
Year
DOI
Venue
2019
10.1145/3341301.3359655
Proceedings of the 27th ACM Symposium on Operating Systems Principles
Field
DocType
ISBN
Edge node,Computer science,Latency (engineering),Traffic routing,Solver,Routing table,Constrained optimization,The Internet,Distributed computing
Conference
978-1-4503-6873-5
Citations 
PageRank 
References 
1
0.35
0
Authors
12
Name
Order
Citations
PageRank
David Chou1161.69
Tianyin Xu241932.99
Kaushik Veeraraghavan337220.81
Andrew Newell410.69
Sonia Margulis5141.30
Lin Xiao610.35
Pol Mauri Ruiz710.35
Justin Meza841918.08
Kiryong Ha910.35
Shruti Padmanabha1020.69
Kevin Cole1110.35
Dmitri Perelman121207.40