ReLight: Capturing spatial-temporal context in Road Traffic Signal Control using recurrency in POMDPs

Authors

Kjær, Peter ; Andersen, Samuel Alexander Vall

Term

4. term

Education

Software, Master

Publication year

2022

Submitted on

2022-06-10

Pages

Abstract

Traffic congestion in urban areas is a problem for the environment and the economy. One solution to minimize congestion is optimizing traffic lights. Traffic signal control is a challenging problem due to the complex traffic flow patterns. Conventional traffic control use pre-coded cycle pattern plans, which suffer from adapting to the complex flow dynamics. Reinforcement Learning allows for dynamic control but is unable to properly catch temporal-feature due to the Markov property. To solve this, recent papers propose incorporating prediction modules into Reinforcement Learning control, however, this suffers from additional loss and generalization. To circumvent these issues, we propose Recurrent Light (ReLight), which treats the environment as a Partially Observable Markov Decision Process which depends on the history of previous belief states. We utilize this dependency to capture spatial-temporal features and utilize an LSTM in the DQN network to capture important long-short term features through hidden states. To properly capture cycle phases, we propose two sampling and two training strategies. In our experiments, we demonstrate that ReLight outperforms state-of-the-art models on one, multi and city-wide datasets.

Keywords

reinforcement learning

Documents

Download
View record in AAU Student Projects

A master's thesis from Aalborg University

ReLight: Capturing spatial-temporal context in Road Traffic Signal Control using recurrency in POMDPs