ikostrikov

Post doc

UC BerkeleyBerkeley

Pinned Repositories

drq
DrQ: Data regularized Q
Language:Jupyter Notebook401 13 2652
implicit_q_learning
Language:Python226 5 938
jaxrl
JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.
Language:Jupyter Notebook608 12 865
pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
Language:Python3.6k 67 229829
pytorch-a3c
PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".
Language:Python1.2k 43 67280
pytorch-ddpg-naf
Implementation of algorithms for continuous control (DDPG and NAF).
Language:Python307 10 972
pytorch-flows
PyTorch implementations of algorithms for density estimation
Language:Python571 18 875
pytorch-meta-optimizer
A PyTorch implementation of Learning to learn by gradient descent by gradient descent
Language:Python309 16 1056
pytorch-trpo
PyTorch implementation of Trust Region Policy Optimization
Language:Python431 13 2091
TensorFlow-VAE-GAN-DRAW
A collection of generative methods implemented with TensorFlow (Deep Convolutional Generative Adversarial Networks (DCGAN), Variational Autoencoder (VAE) and DRAW: A Recurrent Neural Network For Image Generation).
Language:Python595 32 14167

ikostrikov's Repositories

ikostrikov/pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
Language:Python3.6k 67 229829
ikostrikov/pytorch-a3c
PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".
Language:Python1.2k 43 67280
ikostrikov/jaxrl
JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.
Language:Jupyter Notebook608 12 865
ikostrikov/pytorch-flows
PyTorch implementations of algorithms for density estimation
Language:Python571 18 875
ikostrikov/pytorch-trpo
PyTorch implementation of Trust Region Policy Optimization
Language:Python431 13 2091
ikostrikov/pytorch-meta-optimizer
A PyTorch implementation of Learning to learn by gradient descent by gradient descent
Language:Python309 16 1056
ikostrikov/pytorch-ddpg-naf
Implementation of algorithms for continuous control (DDPG and NAF).
Language:Python307 10 972
ikostrikov/walk_in_the_park
Language:Python243 12 535
ikostrikov/implicit_q_learning
Language:Python226 5 938
ikostrikov/TensorFlow-Pointer-Networks
TensorFlow implementation of Pointer Networks
Language:Python203 12 1068
ikostrikov/rlpd
Language:Python201 4 723
ikostrikov/pytorch-rl
57 4 010
ikostrikov/jaxrl2
Language:Jupyter Notebook41 5 215
ikostrikov/dmcgym
Language:Python23 3 117
ikostrikov/linenplus
Flax extensions.
Language:Python5 6 0
ikostrikov/cql-results
Language:Python3 3 11
ikostrikov/gail-experts
3 4 11
ikostrikov/gym
A toolkit for developing and comparing reinforcement learning algorithms.
Language:Python3 2 0
ikostrikov/motion_imitation
Code accompanying the paper "Learning Agile Robotic Locomotion Skills by Imitating Animals"
Language:Python2 2 0
ikostrikov/doodad
Language:Python1 2 0
ikostrikov/Implicit-Q-Learning
PyTorch implementation of the implicit Q-learning algorithm (IQL)
Language:Python1 2 0
ikostrikov/mazelab
A customizable framework to create maze and gridworld environments
Language:Python1 2 0
ikostrikov/Mine_tf2.0
MINE: Mutual Information Neural Estimation in pytorch
Language:Jupyter Notebook1 4 01
ikostrikov/roboverse
A set of environments utilizing pybullet for simulation of robotic manipulation tasks.
Language:Python1 2 0
ikostrikov/unitree_sim
MuJoCo models for Unitree Robots
1 2 0
ikostrikov/d4rl
A benchmark for offline reinforcement learning.
Language:Python2 0
ikostrikov/gym-wordle
Gym environment for playing Wordle with RL agents
Language:Python2 0
ikostrikov/oatomobile
A research framework for autonomous driving
Language:Python2 0
ikostrikov/obj_2_mujoco_msh
Language:Python2 0
ikostrikov/SMAAC
This repo contains the code of "Winning the L2RPN Challenge: Power Grid Management via Semi-Markov Afterstate Actor-Critic".
Language:Python2 0