Title
Efficient and Effective Express via Contextual Cooperative Reinforcement Learning
Abstract
Express systems are widely deployed in many major cities. Couriers in an express system load parcels at transit station and deliver them to customers. Meanwhile, they also try to serve the pick-up requests which come stochastically in real time during the delivery process. Having brought much convenience and promoted the development of e-commerce, express systems face challenges on courier management to complete the massive number of tasks per day. Considering this problem, we propose a reinforcement learning based framework to learn a courier management policy. Firstly, we divide the city into independent regions, in each of which a constant number of couriers deliver parcels and serve requests cooperatively. Secondly, we propose a soft-label clustering algorithm named Balanced Delivery-Service Burden (BDSB) to dispatch parcels to couriers in each region. BDSB guarantees that each courier has almost even delivery and expected request-service burden when departing from transit station, giving a reasonable initialization for online management later. As pick-up requests come in real time, a Contextual Cooperative Reinforcement Learning (CCRL) model is proposed to guide where should each courier deliver and serve in each short period. Being formulated in a multi-agent way, CCRL focuses on the cooperation among couriers while also considering the system context. Experiments on real-world data from Beijing are conducted to confirm the outperformance of our model.
Year
DOI
Keywords
2019
10.1145/3292500.3330968
constrained clustering, express system, reinforcement learning
Field
DocType
ISSN
Computer science,Constrained clustering,Artificial intelligence,Initialization,Cluster analysis,Machine learning,Beijing,Reinforcement learning
Conference
978-1-4503-6201-6
ISBN
Citations 
PageRank 
978-1-4503-6201-6
2
0.38
References 
Authors
0
3
Name
Order
Citations
PageRank
Yexin Li190.92
Yu Zheng28939432.87
Qiang Yang317039875.69