Asynchronous Online Learning in Multi-Agent Systems with Proximity Constraints - Citegraph

Paper Info

Title
Asynchronous Online Learning in Multi-Agent Systems with Proximity Constraints

Abstract
We consider the problem of distributed learning from sequential data via online convex optimization. A multiagent system is considered where each agent has a private objective but is willing to cooperate in order to minimize the network cost, which is the sum of local cost functions. Different from the classical distributed settings, where the agents coordinate through the use of consensus constraints, we allow the neighboring agent actions to be related via a non-linear proximity function. A decentralized saddle point algorithm is proposed that is capable of handling gradient delays arising from computational issues. The proposed online asynchronous algorithm is analyzed under adversarial settings by developing bounds on the regret of $mathcal O (sqrt{T})$ , that measures the cumulative loss incurred by the online algorithm against a clairvoyant, and network discrepancy of $mathcal O (T^{3/4})$ , that measures the cumulative constraint violation or agent disagreement. By allowing the agents to utilize stale gradient information, the proposed algorithm embraces the nuances of distributed learning, and serves to be the first distributed online algorithm that can handle adversarial delays. A modified saddle point algorithm is also proposed that explicitly forces the agents to agree as per the constraint function resulting in zero network discrepancy, while incurring a slightly higher regret. To showcase the efficacy of the proposed asynchronous algorithm, a spatially correlated random field estimation problem is formulated and solved. Additionally, an application of vision based target localization with moving cameras demonstrates the benefits of this approach in practice.

Year	DOI	Venue
2019	10.1109/tsipn.2019.2902493	ieee transactions on signal and information processing over networks
Keywords	Field	DocType
Delays,Optimization,Convex functions,Correlation,Linear programming,Information processing,Multi-agent systems	Online algorithm,Asynchronous communication,Mathematical optimization,Saddle point,Regret,Computer science,Multi-agent system,Convex function,Linear programming,Convex optimization	Journal
Volume	Issue	ISSN
5	3	2373-776X
Citations	PageRank	References
1	0.35	0
Authors
3

Authors (3 rows)

Cited by (1 rows)

References (0 rows)

Name	Order	Citations	PageRank
Amrit Singh Bedi	1	16	9.43
Alec Koppel	2	99	21.66
Ketan Rajawat	3	124	25.44

1