Phansalkar, V. V. ; Thathachar, M. A. L. (1995) Local and global optimization algorithms for generalized learning automata Neural Computation, 7 (5). pp. 950-973. ISSN 0899-7667
Full text not available from this repository.
Official URL: http://portal.acm.org/citation.cfm?id=212723.21273...
Related URL: http://dx.doi.org/10.1162/neco.1995.7.5.950
Abstract
This paper analyzes the long-term behavior of the Reinforce and related algorithms (Williams 1986, 1988, 1992) for generalized learning automata (Narendra and Thathachar 1989) for the associative reinforcement learning problem (Barto and Anandan 1985). The learning system considered here is a feedforward connectionist network of generalized learning automata units. We show that Reinforce is a gradient ascent algorithm but can exhibit unbounded behavior. A modified version of this algorithm, based on constrained optimization techniques, is suggested to overcome this disadvantage. The modified algorithm is shown to exhibit local optimization properties. A global version of the algorithm, based on constant temperature heat bath techniques, is also described and shown to converge to the global maximum. All algorithms are analyzed using weak convergence techniques.
Item Type: | Article |
---|---|
Source: | Copyright of this article belongs to MIT Press. |
ID Code: | 51356 |
Deposited On: | 28 Jul 2011 15:01 |
Last Modified: | 28 Jul 2011 15:01 |
Repository Staff Only: item control page