naokiuchida's Stars
reinforcement-learning-kr/pg_travel
Policy Gradient algorithms (REINFORCE, NPG, TRPO, PPO)
sfujim/TD3
Author's PyTorch implementation of TD3 for OpenAI gym tasks
l5shi/Multi-DDPG-with-parameter-noise
New reinforcement algorithm base on DDPG
mynameisfiber/high_performance_python
Code for the book "High Performance Python" by Micha Gorelick and Ian Ozsvald with OReilly
jnishii/python-rl-introduction
fastai/fastai
The fastai deep learning library
matsuolab-edu/dl4us
CMA-ES/pycma
Python implementation of CMA-ES
NotAnyMike/gym
An improvement of CarRacing-v0 from OpenAI Gym in order to make the environment complex enough for Hierarchical Reinforcement Learning
gui-miotto/DeepLearningLab
ray-project/ray
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
icoxfog417/baby-steps-of-rl-ja
Pythonで学ぶ強化学習 -入門から実践まで- サンプルコード
xl-sr/CAL
[CoRL'18] Conditional Affordance Learning
avisingh599/reward-learning-rl
[RSS 2019] End-to-End Robotic Reinforcement Learning without Reward Engineering
tejus-gupta/hybrid-astar-planner
Hybrid A* Path Planner
karlkurzer/path_planner
Hybrid A* Path Planner for the KTH Research Concept Vehicle
MLCS-Yonsei/Factory_RL_Gazebo
Youbot simulator (SLAM & Navi)