Pinned Repositories
A2C-Pytorch-implementations
Implement the A2C(Advantage Actor-Critic) algorithm using pytorch in multiple environments of openai gym. (Including Cartpole, LunarLander, Pong. Breakout is tuning and maybe complete soon.) Sometime implement the REINFORCE algorithm as variations of A2C.
emergent-language
An implementation of Emergence of Grounded Compositional Language in Multi-Agent Populations by Igor Mordatch and Pieter Abbeel
gym
A toolkit for developing and comparing reinforcement learning algorithms.
LIO
A pytorch reproduction of LIO(Learning to Incentivize Other)
Meta-gradient_RL
A toy implementation of paper "Meta-Gradient Reinforcement Learning"
ray_exercise
Some exercises performed during learning ray/tune/rllib
woithook's Repositories
woithook/A2C-Pytorch-implementations
Implement the A2C(Advantage Actor-Critic) algorithm using pytorch in multiple environments of openai gym. (Including Cartpole, LunarLander, Pong. Breakout is tuning and maybe complete soon.) Sometime implement the REINFORCE algorithm as variations of A2C.
woithook/emergent-language
An implementation of Emergence of Grounded Compositional Language in Multi-Agent Populations by Igor Mordatch and Pieter Abbeel
woithook/gym
A toolkit for developing and comparing reinforcement learning algorithms.
woithook/LIO
A pytorch reproduction of LIO(Learning to Incentivize Other)
woithook/Meta-gradient_RL
A toy implementation of paper "Meta-Gradient Reinforcement Learning"
woithook/ray_exercise
Some exercises performed during learning ray/tune/rllib