He, Ying ; Bhatnagar, Shalabh ; Fu, Michael C. ; Marcus, Steven I. (2000) Approximate Policy Iteration for Semiconductor Fab-Level Decision Making - a Case Study Institute for Systems Research Technical Reports .
Full text not available from this repository.
Related URL: http://dx.doi.org/https://drum.lib.umd.edu/bitstream/handle/1903/6142/TR_2000-49.pdf?sequence=1&isAllowed=y
Abstract
In this paper, we propose an approximate policy iteration (API) algorithm for asemiconductor fab-level decision making problem. This problem is formulated as adiscounted cost Markov Decision Process (MDP), and we have applied exact policy iterationto solve a simple example in prior work. However, the overwhelmingcomputational requirements of exact policy iteration prevent its application forlarger problems. Approximate policy iteration overcomes this obstacle by approximating thecost-to-go using function approximation. Numerical simulation on the same example showsthat the proposed API algorithm leads to a policy with cost close to that of the optimalpolicy.
Item Type: | Article |
---|---|
Keywords: | Approximate Policy Iteration, Semiconductor Fab-Level Decision Making, Markov Decision Processes, Discounted Cost Problem. |
ID Code: | 116600 |
Deposited On: | 12 Apr 2021 07:08 |
Last Modified: | 12 Apr 2021 07:08 |
Repository Staff Only: item control page