Bhatnagar, Shalabh ; Abdulla, Mohammed Shahid (2008) Simulation-Based Optimization Algorithms for Finite-Horizon Markov Decision Processes Simulation, 84 (12). pp. 577-600. ISSN 0037-5497
Full text not available from this repository.
Official URL: http://doi.org/10.1177/0037549708098120
Related URL: http://dx.doi.org/10.1177/0037549708098120
Abstract
We develop four simulation-based algorithms for finite-horizon Markov decision processes. Two of these algorithms are developed for finite state and compact action spaces while the other two are for finite state and finite action spaces. Of the former two, one algorithm uses a linear parameterization for the policy, resulting in reduced memory complexity. Convergence analysis is briefly sketched and illustrative numerical experiments with the four algorithms are shown for a problem of flow control in communication networks.
Item Type: | Article |
---|---|
Source: | Copyright of this article belongs to Simulation Councils Inc. |
Keywords: | Finite-Horizon Markov Decision Processes; Simulation-Based Algorithms; Two-Timescale Stochastic Approximation; Function Approximation; Actor-Critic Algorithms; Normalized Hadamard Matrices. |
ID Code: | 116554 |
Deposited On: | 12 Apr 2021 06:47 |
Last Modified: | 12 Apr 2021 06:47 |
Repository Staff Only: item control page