Military Units Optimal Formation in Wargame Simulation Using Multiagent Reinforcment Learning

Document Type : Original Article

Authors

1 AmirKabir Uni

2 Amirkabir Uni

Abstract

In this paper, multiagent reinforcement learning has been used in wargame simulation. Military organization causes priorities in actions taken among agents in the feild. Decision making among n-person generalsum multiagent systems with consecutive action selection can be modeled through extensive form games. The existing process of learning can be considered as a set of successive extensive form games with Markov property. We use subgame perfect equilibrium points as optimal joint action using the well-known backward induction algorithm. An estimation of possible utilities gained over a game with respect to other agents’ preferences, called as associative Q-values, is used to predict optimal action. Simulation results present the effectiveness of the proposed algorithms in wargame simulation.

Keywords