A constrained optimization perspective on actor–critic algorithms and application to network routing

L.A., Prashanth ; H.L., Prasad ; Bhatnagar, Shalabh ; Chandra, Prakash (2016) A constrained optimization perspective on actor–critic algorithms and application to network routing Systems & Control Letters, 92 . pp. 46-51. ISSN 01676911

Full text not available from this repository.

Official URL: http://doi.org/10.1016/j.sysconle.2016.02.020

Related URL: http://dx.doi.org/10.1016/j.sysconle.2016.02.020

Abstract

We propose a novel actor–critic algorithm with guaranteed convergence to an optimal policy for a discounted reward Markov decision process. The actor incorporates a descent direction that is motivated by the solution of a certain non-linear optimization problem. We also discuss an extension to incorporate function approximation and demonstrate the practicality of our algorithms on a network routing application.

Item Type:Article
Source:Copyright of this article belongs to Elsevier B.V.
Keywords:Actor-Critic Algorithm; Reinforcement Learning; Constrained Optimization.
ID Code:116492
Deposited On:12 Apr 2021 06:01
Last Modified:12 Apr 2021 06:01

Repository Staff Only: item control page