Title
Asynchronous Online Learning in Multi-Agent Systems with Proximity Constraints
Abstract
We consider the problem of distributed learning from sequential data via online convex optimization. A multiagent system is considered where each agent has a private objective but is willing to cooperate in order to minimize the network cost, which is the sum of local cost functions. Different from the classical distributed settings, where the agents coordinate through the use of consensus constraints, we allow the neighboring agent actions to be related via a non-linear proximity function. A decentralized saddle point algorithm is proposed that is capable of handling gradient delays arising from computational issues. The proposed online asynchronous algorithm is analyzed under adversarial settings by developing bounds on the regret of $mathcal O (sqrt{T})$ , that measures the cumulative loss incurred by the online algorithm against a clairvoyant, and network discrepancy of $mathcal O (T^{3/4})$ , that measures the cumulative constraint violation or agent disagreement. By allowing the agents to utilize stale gradient information, the proposed algorithm embraces the nuances of distributed learning, and serves to be the first distributed online algorithm that can handle adversarial delays. A modified saddle point algorithm is also proposed that explicitly forces the agents to agree as per the constraint function resulting in zero network discrepancy, while incurring a slightly higher regret. To showcase the efficacy of the proposed asynchronous algorithm, a spatially correlated random field estimation problem is formulated and solved. Additionally, an application of vision based target localization with moving cameras demonstrates the benefits of this approach in practice.
Year
DOI
Venue
2019
10.1109/tsipn.2019.2902493
ieee transactions on signal and information processing over networks
Keywords
Field
DocType
Delays,Optimization,Convex functions,Correlation,Linear programming,Information processing,Multi-agent systems
Online algorithm,Asynchronous communication,Mathematical optimization,Saddle point,Regret,Computer science,Multi-agent system,Convex function,Linear programming,Convex optimization
Journal
Volume
Issue
ISSN
5
3
2373-776X
Citations 
PageRank 
References 
1
0.35
0
Authors
3
Name
Order
Citations
PageRank
Amrit Singh Bedi1169.43
Alec Koppel29921.66
Ketan Rajawat312425.44