Bhatnagar, Shalabh (2011) The Borkar–Meyn theorem for asynchronous stochastic approximations Systems & Control Letters, 60 (7). pp. 472-478. ISSN 0167-6911
Full text not available from this repository.
Official URL: http://doi.org/10.1016/j.sysconle.2011.04.002
Related URL: http://dx.doi.org/10.1016/j.sysconle.2011.04.002
Abstract
In this paper, we give a generalization of a result by Borkar and Meyn (2000) [1], on the stability and convergence of synchronous-update stochastic approximation algorithms, to the case of asynchronous stochastic approximations with delays. We then describe an interesting application of the result to asynchronous distributed temporal difference (TD) learning with function approximation and delays.
Item Type: | Article |
---|---|
Source: | Copyright of this article belongs to Elsevier B.V. |
Keywords: | The Borkar–Meyn Theorem; Asynchronous Stochastic Approximation With Delays; Temporal Difference Learning. |
ID Code: | 116538 |
Deposited On: | 12 Apr 2021 06:46 |
Last Modified: | 12 Apr 2021 06:46 |
Repository Staff Only: item control page