A Relative Value Iteration Algorithm for Nondegenerate Controlled Diffusions

Arapostathis, Ari ; Borkar, Vivek S. (2012) A Relative Value Iteration Algorithm for Nondegenerate Controlled Diffusions SIAM Journal on Control and Optimization, 50 (4). pp. 1886-1902. ISSN 0363-0129

Full text not available from this repository.

Official URL: http://doi.org/10.1137/110850529

Related URL: http://dx.doi.org/10.1137/110850529

Abstract

The ergodic control problem for a nondegenerate diffusion controlled through its drift is considered under a uniform stability condition that ensures the well-posedness of the associated Hamilton--Jacobi--Bellman (HJB) equation. A nonlinear parabolic evolution equation is then proposed as a continuous time, continuous state space analogue of White's relative value iteration algorithm for solving the ergodic dynamic programming equation for the finite state, finite action case. Its convergence to the solution of the HJB equation is established using the theory of monotone dynamical systems and also, alternatively, by using the theory of reverse martingales.

Item Type:Article
Source:Copyright of this article belongs to Society for Industrial & Applied Mathematics.
ID Code:135231
Deposited On:20 Jan 2023 07:14
Last Modified:20 Jan 2023 07:14

Repository Staff Only: item control page