Borkar, V. S. (2001) A sensitivity formula for risk-sensitive cost and the actor-critic algorithm Systems & Control Letters, 44 (5). pp. 339-346. ISSN 0167-6911
Full text not available from this repository.
Official URL: http://linkinghub.elsevier.com/retrieve/pii/S01676...
Related URL: http://dx.doi.org/10.1016/S0167-6911(01)00152-9
Abstract
We propose for risk sensitive control of finite Markov chains a counterpart of the popular 'actor-critic'algorithm for classical Markov decision processes. The algorithm is based on a 'sensitivity formula'for the risk sensitive cost; is shown to converge with probability one to the desired solution. The proof technique is an adaptation of the ordinary differential equations approach for the analysis of two time-scale stochastic approximation algorithms.
Item Type: | Article |
---|---|
Source: | Copyright of this article belongs to Elsevier Science. |
Keywords: | Markov Decision Processes; Risk Sensitive Control; Reinforcement Learning; Actor-critic Algorithms; Parametric Sensitivity; Stochastic Approximation |
ID Code: | 5283 |
Deposited On: | 18 Oct 2010 08:32 |
Last Modified: | 20 May 2011 09:04 |
Repository Staff Only: item control page