Mathkar, Adwaitvedant ; Borkar, Vivek S. (2017) Distributed Reinforcement Learning via Gossip IEEE Transactions on Automatic Control, 62 (3). pp. 1465-1470. ISSN 0018-9286
Full text not available from this repository.
Official URL: http://doi.org/10.1109/TAC.2016.2585302
Related URL: http://dx.doi.org/10.1109/TAC.2016.2585302
Abstract
We consider the classical TD(0) algorithm implemented on a network of agents wherein the agents also incorporate updates received from neighboring agents using a gossip-like mechanism. The combined scheme is shown to converge for both discounted and average cost problems.
Item Type: | Article |
---|---|
Source: | Copyright of this article belongs to Institute of Electrical and Electronic Engineers. |
ID Code: | 135167 |
Deposited On: | 19 Jan 2023 11:26 |
Last Modified: | 19 Jan 2023 11:26 |
Repository Staff Only: item control page