Pinned Repositories
drq
DrQ: Data regularized Q
implicit_q_learning
jaxrl
JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.
pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
pytorch-a3c
PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".
pytorch-ddpg-naf
Implementation of algorithms for continuous control (DDPG and NAF).
pytorch-flows
PyTorch implementations of algorithms for density estimation
pytorch-meta-optimizer
A PyTorch implementation of Learning to learn by gradient descent by gradient descent
pytorch-trpo
PyTorch implementation of Trust Region Policy Optimization
TensorFlow-VAE-GAN-DRAW
A collection of generative methods implemented with TensorFlow (Deep Convolutional Generative Adversarial Networks (DCGAN), Variational Autoencoder (VAE) and DRAW: A Recurrent Neural Network For Image Generation).
ikostrikov's Repositories
ikostrikov/pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
ikostrikov/pytorch-a3c
PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".
ikostrikov/jaxrl
JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.
ikostrikov/pytorch-flows
PyTorch implementations of algorithms for density estimation
ikostrikov/pytorch-trpo
PyTorch implementation of Trust Region Policy Optimization
ikostrikov/pytorch-meta-optimizer
A PyTorch implementation of Learning to learn by gradient descent by gradient descent
ikostrikov/pytorch-ddpg-naf
Implementation of algorithms for continuous control (DDPG and NAF).
ikostrikov/walk_in_the_park
ikostrikov/implicit_q_learning
ikostrikov/TensorFlow-Pointer-Networks
TensorFlow implementation of Pointer Networks
ikostrikov/rlpd
ikostrikov/pytorch-rl
ikostrikov/jaxrl2
ikostrikov/dmcgym
ikostrikov/linenplus
Flax extensions.
ikostrikov/cql-results
ikostrikov/gail-experts
ikostrikov/gym
A toolkit for developing and comparing reinforcement learning algorithms.
ikostrikov/motion_imitation
Code accompanying the paper "Learning Agile Robotic Locomotion Skills by Imitating Animals"
ikostrikov/doodad
ikostrikov/Implicit-Q-Learning
PyTorch implementation of the implicit Q-learning algorithm (IQL)
ikostrikov/mazelab
A customizable framework to create maze and gridworld environments
ikostrikov/Mine_tf2.0
MINE: Mutual Information Neural Estimation in pytorch
ikostrikov/roboverse
A set of environments utilizing pybullet for simulation of robotic manipulation tasks.
ikostrikov/unitree_sim
MuJoCo models for Unitree Robots
ikostrikov/d4rl
A benchmark for offline reinforcement learning.
ikostrikov/gym-wordle
Gym environment for playing Wordle with RL agents
ikostrikov/oatomobile
A research framework for autonomous driving
ikostrikov/obj_2_mujoco_msh
ikostrikov/SMAAC
This repo contains the code of "Winning the L2RPN Challenge: Power Grid Management via Semi-Markov Afterstate Actor-Critic".