Simulation-Based Optimization Algorithms for Finite-Horizon Markov Decision Processes

Dimensions

Bhatnagar, Shalabh ; Abdulla, Mohammed Shahid (2008) Simulation-Based Optimization Algorithms for Finite-Horizon Markov Decision Processes Simulation, 84 (12). pp. 577-600. ISSN 0037-5497

Full text not available from this repository.

Official URL: http://doi.org/10.1177/0037549708098120

Related URL: http://dx.doi.org/10.1177/0037549708098120

Abstract

We develop four simulation-based algorithms for finite-horizon Markov decision processes. Two of these algorithms are developed for finite state and compact action spaces while the other two are for finite state and finite action spaces. Of the former two, one algorithm uses a linear parameterization for the policy, resulting in reduced memory complexity. Convergence analysis is briefly sketched and illustrative numerical experiments with the four algorithms are shown for a problem of flow control in communication networks.

Item Type:	Article
Source:	Copyright of this article belongs to Simulation Councils Inc.
Keywords:	Finite-Horizon Markov Decision Processes; Simulation-based Algorithms; Two-Timescale Stochastic Approximation, Function Approximation; Actor-Critic Algorithms; Normalized Hadamard Matrices.
ID Code:	116599
Deposited On:	12 Apr 2021 07:08
Last Modified:	12 Apr 2021 07:08

Repository Staff Only: item control page

PlumX Metrics