A3C-CarRacing

An agent trained using A3C to play openai gym CarRacing-v0, which can get as high as 900+ score

I have trained the model on 4 azure vCPUs one week and store the weights and logs in bak
the agent can get average 870+ scores
the rewards curve is shown below, where I estimate human level and max score by 750 and 950 for compare

I have define 9 actions for AI, which are:

More actions can make the agent gain higher score and drive more smoothly.

different from other related repositories, I sample action by probability even when testing instead of using argmax, because I find it can make the agent do better. I argue that it's because when considering more than one frames and average the actions, the agent actually can do more continuous action.

python 3.x
pytorch 0.4.0
pyvirtualdisplay (if you want to train the model on a server which don't have a monitor)
To get more suitable window size, I modify car_racing.py in installation folder (for example site-packages\gym\envs\box2d), which is
```
line 46: WINDOW_W = 1200  ->  WINDOW_W = 900 
line 47: WINDOW_H = 1000  ->  WINDOW_H = 750
```
you are suggested to do the same, otherwise the agent may can't adapt to the scale change and get lower score

chenhang98/A3C-CarRacing