Pinned Repositories
ad-eps
Official Implementation for "In-Context Reinforcement Learning from Noise Distillation"
CORL
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC
headless-ad
Official Implementation for "In-Context Reinforcement Learning for Variable Action Spaces"
xland-minigrid
JAX-accelerated Meta-Reinforcement Learning Environments Inspired by XLand and MiniGrid 🏎️
ReBRAC
Author's implementation of ReBRAC, a minimalist improvement upon TD3+BC
sac-rnd
Official implementation for "Anti-Exploration by Random Network Distillation", ICML 2023
guided-es-by-differentiable-simulators
Guiding evolutionary strategies by (inaccurate) differentiable robot simulators @ NeurIPS, 4th Robot Learning Workshop
haxball-chameleon
Solving Haxball (www.haxball.com) using Imitation Learning methods.
hierarchical-skill-acquisition
Implementation of the Hierarchical and Interpretable Skill Acquisition in Multi-task Reinforcement Learning by Tianmin Shu, Caiming Xiong, and Richard Socher
language-grounding-multigoal
An accompanying code and experiments' results for Task-Oriented Language Grounding for Language Input with Multiple Sub-Goals of Non-Linear Order
vkurenkov's Repositories
vkurenkov/haxball-chameleon
Solving Haxball (www.haxball.com) using Imitation Learning methods.
vkurenkov/hierarchical-skill-acquisition
Implementation of the Hierarchical and Interpretable Skill Acquisition in Multi-task Reinforcement Learning by Tianmin Shu, Caiming Xiong, and Richard Socher
vkurenkov/language-grounding-multigoal
An accompanying code and experiments' results for Task-Oriented Language Grounding for Language Input with Multiple Sub-Goals of Non-Linear Order
vkurenkov/guided-es-by-differentiable-simulators
Guiding evolutionary strategies by (inaccurate) differentiable robot simulators @ NeurIPS, 4th Robot Learning Workshop
vkurenkov/cem-tetris
Solving Tetris using Cross-Entropy Method
vkurenkov/graphy-db
Graph Database
vkurenkov/tensegrity
vkurenkov/bcr-project
Experiments with Guided Evolutionary Strategies for Behavioral Robotics course project at Innopolis Univeristy
vkurenkov/annealed-salesman
Simulated annealing for traveling salesman problem
vkurenkov/awesome-offline-rl
An index of algorithms for offline reinforcement learning (offline-rl)
vkurenkov/case-212
Открытое письмо специалистов IT-индустрии в защиту фигурантов «московского дела»
vkurenkov/catalyst
Accelerated deep learning R&D
vkurenkov/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
vkurenkov/D4RL
A collection of reference environments for offline reinforcement learning
vkurenkov/DeepRL-Grounding
Train an RL agent to execute natural language instructions in a 3D Environment (PyTorch)
vkurenkov/fastapi-basic-mongodb-example
Basic Structure for FastAPI that uses Motor (Async MongoDB Driver)
vkurenkov/gym-minigrid
Minimalistic gridworld package for OpenAI Gym
vkurenkov/ICPy
vkurenkov/lightfm
A Python implementation of LightFM, a hybrid recommendation algorithm.
vkurenkov/muon-detection
vkurenkov/pytorch-a3c
PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".
vkurenkov/researcher
A jekyll based resume template
vkurenkov/rlax
vkurenkov/rrc_simulation
Simulation for the Real Robot Challenge (https://real-robot-challenge.com)
vkurenkov/TD3_BC
Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL
vkurenkov/vkurenkov.github.io