- a3c:for continous action space, use multi processes, but saving model has not been implemented.
- trpo:for continous and discrete action space
- a3c:run a3c/a3c_continous.py in pycharm IDE
- trpo:run experiment/trpo_continous.py in pycharm IDE
reinfore learning tool box, contains trpo, a3c algorithm for continous action space
Python