Prashanth, L A ; Bhatnagar, Shalabh (2011) Reinforcement learning with average cost for adaptive control of traffic lights at intersections In: 14th International IEEE Conference on Intelligent Transportation Systems (ITSC), 5-7 Oct. 2011, Washington, DC, USA.
Full text not available from this repository.
Official URL: http://doi.org/10.1109/ITSC.2011.6082823
Related URL: http://dx.doi.org/10.1109/ITSC.2011.6082823
Abstract
We propose for the first time two reinforcement learning algorithms with function approximation for average cost adaptive control of traffic lights. One of these algorithms is a version of Q-learning with function approximation while the other is a policy gradient actor-critic algorithm that incorporates multi-timescale stochastic approximation. We show performance comparisons on various network settings of these algorithms with a range of fixed timing algorithms, as well as a Q-learning algorithm with full state representation that we also implement. We observe that whereas (as expected) on a two-junction corridor, the full state representation algorithm shows the best results, this algorithm is not implementable on larger road networks. The algorithm PG-AC-TLC that we propose is seen to show the best overall performance.
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Source: | Copyright of this article belongs to Institute of Electrical and Electronics Engineers. |
Keywords: | Traffic Signal Control; Reinforcement Learning; Q-Learning; Policy Gradient Actor-Critic. |
ID Code: | 116682 |
Deposited On: | 12 Apr 2021 07:23 |
Last Modified: | 12 Apr 2021 07:23 |
Repository Staff Only: item control page