woithook

Pinned Repositories

A2C-Pytorch-implementations
Implement the A2C(Advantage Actor-Critic) algorithm using pytorch in multiple environments of openai gym. (Including Cartpole, LunarLander, Pong. Breakout is tuning and maybe complete soon.) Sometime implement the REINFORCE algorithm as variations of A2C.
Language:Python11
emergent-language
An implementation of Emergence of Grounded Compositional Language in Multi-Agent Populations by Igor Mordatch and Pieter Abbeel
Language:Python00
gym
A toolkit for developing and comparing reinforcement learning algorithms.
Language:Python0 0 00
LIO
A pytorch reproduction of LIO(Learning to Incentivize Other)
Language:Python0 1 00
Meta-gradient_RL
A toy implementation of paper "Meta-Gradient Reinforcement Learning"
Language:Jupyter Notebook00
ray_exercise
Some exercises performed during learning ray/tune/rllib
Language:Python00

woithook's Repositories

woithook/A2C-Pytorch-implementations
Implement the A2C(Advantage Actor-Critic) algorithm using pytorch in multiple environments of openai gym. (Including Cartpole, LunarLander, Pong. Breakout is tuning and maybe complete soon.) Sometime implement the REINFORCE algorithm as variations of A2C.
Language:Python11
woithook/emergent-language
An implementation of Emergence of Grounded Compositional Language in Multi-Agent Populations by Igor Mordatch and Pieter Abbeel
Language:Python00
woithook/gym
A toolkit for developing and comparing reinforcement learning algorithms.
Language:Python0 0 00
woithook/LIO
A pytorch reproduction of LIO(Learning to Incentivize Other)
Language:Python0 1 00
woithook/Meta-gradient_RL
A toy implementation of paper "Meta-Gradient Reinforcement Learning"
Language:Jupyter Notebook00
woithook/ray_exercise
Some exercises performed during learning ray/tune/rllib
Language:Python00