This is a reinforcement learning pipeline for intelligent traffic signal control. Usage and more information can be found below.
We recommend to run the code through docker. Some brief documentation can be found at https://docs.docker.com/.
-
Please pull the docker image from the docker hub.
docker pull gjzheng/frap:latest
-
Please run the built docker image to initiate a docker container. Please remember to mount the code directory.
docker run -it -v path/to/the/code/repo/:/work/ simulator-test bash
cd /work/
bash example_run.sh
Start an example:
sh example_run.sh
In this script, it runs the file run_batch.py
. Here are some important arguments that can be modified for different experiments:
- memo: the memo name of the experiment
- algorithm: the specified algorithm, e.g., TransferDQN.
- num_phase: phase setting for the experiment, e.g., 2, 4, or 8. Default is 8. Other settings may need extra data.
Hyperparameters such as learning rate, sample size and the like for the agent can also be assigned in our code and they are easy to tune for better performance.
-
agent.py
A abstract class of different agents.
-
network_agent.py
A abstract class of neural network based agents. All methods are defined in this file but
build_network()
, which means onlybuild_network()
is necessary in specific network agents. -
transfer_dqn_agent.py
This is our proposed framework which can handle all-phase scenarios and achieve invariance to symmetrical transformation like flipping and rotation.
More details about this project are demonstrated in this part.
-
config.py
The whole configuration of this project. Note that some parameters will be replaced in
runexp.py
while others can only be changed in this file, please be very careful!!! -
pipeline.py
The whole pipeline is implemented in this module:
Start a SUMO environment, run a simulation for certain time(one round), construct samples from raw log data, update the model and model pooling.
-
generator.py
A generator to load a model, start a SUMO enviroment, conduct a simulation and log the results.
-
sumo_env.py
Define a SUMO environment to interact with SUMO and obtain needed data like features.
-
anon_env.py
Implement a multi-process version of SUMO.
-
construct_sample.py
Construct training samples from original data. Select desired state features in the config and compute the corrsponding average/instant reward with specific measure time.
-
updater.py
Define a class of updater for model updating.