Diddigi, Raghuram Bharadwaj ; Reddy, D. Sai Koti ; K.J., Prabuchandran ; Bhatnagar, Shalabh (2019) Actor-Critic Algorithms for Constrained Multi-agent Reinforcement Learning In: 18th International Conference on Autonomous Agents and Multiagent Systems, May 2019, Montreal, QC, Canada.
Full text not available from this repository.
Official URL: https://dl.acm.org/doi/proceedings/10.5555/3306127
Abstract
Multi-agent reinforcement learning has gained lot of popularity primarily owing to the success of deep function approximation architectures. However, many real-life multi-agent applications often impose constraints on the joint action sequence that can be taken by the agents. In this work, we formulate such problems in the framework of constrained cooperative stochastic games. Under this setting, the goal of the agents is to obtain joint action sequence that minimizes a total cost objective criterion subject to total cost penalty/budget functional constraints. To this end, we utilize the Lagrangian formulation and propose actor-critic algorithms. Through experiments on a constrained multi-agent grid world task, we demonstrate that our algorithms converge to near-optimal joint action sequences satisfying the given constraints.
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Source: | Copyright of this article belongs to Association for Computing Machinery. |
Keywords: | Theory Of Computation; Theory And Algorithms For Application Domains; Machine Learning Theory Reinforcement Learning; Multi-Agent Reinforcement Learning. |
ID Code: | 116632 |
Deposited On: | 12 Apr 2021 07:15 |
Last Modified: | 12 Apr 2021 07:15 |
Repository Staff Only: item control page