A sensitivity formula for risk-sensitive cost and the actor-critic algorithm

Borkar, V. S. (2001) A sensitivity formula for risk-sensitive cost and the actor-critic algorithm Systems & Control Letters, 44 (5). pp. 339-346. ISSN 0167-6911

Full text not available from this repository.

Official URL: http://linkinghub.elsevier.com/retrieve/pii/S01676...

Related URL: http://dx.doi.org/10.1016/S0167-6911(01)00152-9

Abstract

We propose for risk sensitive control of finite Markov chains a counterpart of the popular 'actor-critic'algorithm for classical Markov decision processes. The algorithm is based on a 'sensitivity formula'for the risk sensitive cost; is shown to converge with probability one to the desired solution. The proof technique is an adaptation of the ordinary differential equations approach for the analysis of two time-scale stochastic approximation algorithms.

Item Type:Article
Source:Copyright of this article belongs to Elsevier Science.
Keywords:Markov Decision Processes; Risk Sensitive Control; Reinforcement Learning; Actor-critic Algorithms; Parametric Sensitivity; Stochastic Approximation
ID Code:5283
Deposited On:18 Oct 2010 08:32
Last Modified:20 May 2011 09:04

Repository Staff Only: item control page