L.A., Prashanth ; H.L., Prasad ; Bhatnagar, Shalabh ; Chandra, Prakash (2016) A constrained optimization perspective on actor–critic algorithms and application to network routing Systems & Control Letters, 92 . pp. 46-51. ISSN 01676911
Full text not available from this repository.
Official URL: http://doi.org/10.1016/j.sysconle.2016.02.020
Related URL: http://dx.doi.org/10.1016/j.sysconle.2016.02.020
Abstract
We propose a novel actor–critic algorithm with guaranteed convergence to an optimal policy for a discounted reward Markov decision process. The actor incorporates a descent direction that is motivated by the solution of a certain non-linear optimization problem. We also discuss an extension to incorporate function approximation and demonstrate the practicality of our algorithms on a network routing application.
Item Type: | Article |
---|---|
Source: | Copyright of this article belongs to Elsevier B.V. |
Keywords: | Actor-Critic Algorithm; Reinforcement Learning; Constrained Optimization. |
ID Code: | 116492 |
Deposited On: | 12 Apr 2021 06:01 |
Last Modified: | 12 Apr 2021 06:01 |
Repository Staff Only: item control page