Title
Behavior Reasoning for Opponent Agents in Multi-Agent Learning Systems
Abstract
One important component of developing autonomous agents lies in the accurate prediction of their opponents’ behaviors when the agents interact with others in an uncertain environment. Most recent study focuses on first constructing predictive types (or models) of the opponents, considering their various properties of interest, and subsequently using these models to predict their behaviors accordingly. However, as the possible type space can be rather large, it is time-consuming, and sometimes even infeasible, to predict the actual behaviors of opponents with all candidate types. Thus, in this paper a tractable opponent behavior reasoning approach is proposed that facilitates ( <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"><tex-math notation="LaTeX">$a$</tex-math></inline-formula> ) extraction of a small yet representative summary of all candidates using sub-modular-type maximization, and accordingly, ( <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"><tex-math notation="LaTeX">$b$</tex-math></inline-formula> ) identification of the most appropriate type for real-time behavior prediction based on multi-armed bandits. In addition, we propose a knowledge-transfer scheme through demonstration learning to synchronize subject agents’ knowledge about their opponents’ behaviors. This further reduces the burden of reasoning with all models of their opponents from the perspective of individual subject agents. We integrate the new behavior prediction and reasoning method into a state-of-the-art evolutionary multi-agent framework, namely a memetic multi-agent system (MeMAS), and demonstrate empirical performance in two problem domains.
Year
DOI
Venue
2022
10.1109/TETCI.2022.3147011
IEEE Transactions on Emerging Topics in Computational Intelligence
Keywords
DocType
Volume
Opponent modeling,multi-agent systems,behavior prediction and reasoning,memetic computing
Journal
6
Issue
ISSN
Citations 
5
2471-285X
0
PageRank 
References 
Authors
0.34
20
7
Name
Order
Citations
PageRank
Yaqing Hou100.34
Mingyang Sun200.34
Wenxuan Zhu300.34
Yifeng Zeng441543.27
Haiyin Piao500.34
Xuefeng Chen6394.55
Qiang Zhang729245.54