gym-autonmscar

Autonomous car simulation environment for RL agent.

Usage

Install

Clone this repository.

$ git clone https://github.com/Jueun-Park/gym-autonmscar.git

Before installing stable-baselines, please refer to stable-baselines README and install the prerequisites.
cd into gym-autonmscar, then install the requirements.

$ pip install -e .

You can use virtualenv for isolation of the training environment. The python version used in this project is python3.6.8.

Train and load the model

Train the model using the test code. The test code used DQN implemented in the stable-baselines.

$ python test/train_test.py
$ python test/load_test.py

You can check the graph using tensorboard.

$ tensorboard --logdir ./dqn_tensorboard/

Make `.gif` of training result

import imageio, and using 'rgb_array' mode when to render the environment. imageio.mimsave() will make an animated gif file of training result. You can see the example code in train/gif.py.

[DDPG training result example]

Details

Observation space

The obs space is 363 dimensions.

360 dimension data of LiDAR, the range is 0 - 100.
- Each value is the distance to obstacle every 0.5 degrees. The range of the angle is 0 to 180 degrees and the front of the car is 90 degrees.
2 dimension data of the position of the car, the range is the size of the map.
A value of the speed of the car, the range is 1 - 10.

Action space

The action space is 4 dimensions.

up, down, left, right

Discrete

The agent has to choose one action each time step. (ex. DQN)

Usage of discrete action space env

import gym
import gym_autonmscar
env = gym.make('autonmscar-v0')

Continuous

The agent can guess the continuous rational number values in the range of [-1, 1] in 4 dimensions each time step, then the environment changes the four values as the probability of choosing one of four actions. (ex. DDPG)

Usage of continuous action space env

import gym
import gym_autonmscar
env = gym.make('autonmscarContinuous-v0')

Rewards

When colliding the wall
- reward -= 10
When getting the trophy
- reward += 100 / [seconds after start] + 1000
Else status
- reward += [speed of the car]
- if the speed of car is 0, then reward = -1

Jueun-Park/gym-autonmscar