Distributed Reinforcement Learning via Gossip

Dimensions

Mathkar, Adwaitvedant ; Borkar, Vivek S. (2017) Distributed Reinforcement Learning via Gossip IEEE Transactions on Automatic Control, 62 (3). pp. 1465-1470. ISSN 0018-9286

Full text not available from this repository.

Official URL: http://doi.org/10.1109/TAC.2016.2585302

Related URL: http://dx.doi.org/10.1109/TAC.2016.2585302

Abstract

We consider the classical TD(0) algorithm implemented on a network of agents wherein the agents also incorporate updates received from neighboring agents using a gossip-like mechanism. The combined scheme is shown to converge for both discounted and average cost problems.

Item Type:	Article
Source:	Copyright of this article belongs to Institute of Electrical and Electronic Engineers.
ID Code:	135167
Deposited On:	19 Jan 2023 11:26
Last Modified:	19 Jan 2023 11:26

Repository Staff Only: item control page