vkurenkov

a reinforcement learning orc

@tinkoff-aiKazan

Pinned Repositories

ad-eps
Official Implementation for "In-Context Reinforcement Learning from Noise Distillation"
Language:Python18 2 00
CORL
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC
Language:Python408 3 1015
headless-ad
Official Implementation for "In-Context Reinforcement Learning for Variable Action Spaces"
Language:Python20 4 00
xland-minigrid
JAX-accelerated Meta-Reinforcement Learning Environments Inspired by XLand and MiniGrid 🏎️
Language:Python153 7 1211
ReBRAC
Author's implementation of ReBRAC, a minimalist improvement upon TD3+BC
Language:Jupyter Notebook50 2 06
sac-rnd
Official implementation for "Anti-Exploration by Random Network Distillation", ICML 2023
Language:Python47 3 04
guided-es-by-differentiable-simulators
Guiding evolutionary strategies by (inaccurate) differentiable robot simulators @ NeurIPS, 4th Robot Learning Workshop
Language:Python6 2 01
haxball-chameleon
Solving Haxball (www.haxball.com) using Imitation Learning methods.
Language:JavaScript22 5 25
hierarchical-skill-acquisition
Implementation of the Hierarchical and Interpretable Skill Acquisition in Multi-task Reinforcement Learning by Tianmin Shu, Caiming Xiong, and Richard Socher
Language:Python11 2 10
language-grounding-multigoal
An accompanying code and experiments' results for Task-Oriented Language Grounding for Language Input with Multiple Sub-Goals of Non-Linear Order
Language:Jupyter Notebook8 4 01

vkurenkov's Repositories

vkurenkov/haxball-chameleon
Solving Haxball (www.haxball.com) using Imitation Learning methods.
Language:JavaScript22 5 25
vkurenkov/hierarchical-skill-acquisition
Implementation of the Hierarchical and Interpretable Skill Acquisition in Multi-task Reinforcement Learning by Tianmin Shu, Caiming Xiong, and Richard Socher
Language:Python11 2 10
vkurenkov/language-grounding-multigoal
An accompanying code and experiments' results for Task-Oriented Language Grounding for Language Input with Multiple Sub-Goals of Non-Linear Order
Language:Jupyter Notebook8 4 01
vkurenkov/guided-es-by-differentiable-simulators
Guiding evolutionary strategies by (inaccurate) differentiable robot simulators @ NeurIPS, 4th Robot Learning Workshop
Language:Python6 2 01
vkurenkov/cem-tetris
Solving Tetris using Cross-Entropy Method
Language:Haskell4 2 00
vkurenkov/graphy-db
Graph Database
Language:C#4 3 13
vkurenkov/tensegrity
Language:Python4 0 00
vkurenkov/bcr-project
Experiments with Guided Evolutionary Strategies for Behavioral Robotics course project at Innopolis Univeristy
Language:Python3 3 01
vkurenkov/annealed-salesman
Simulated annealing for traveling salesman problem
Language:Jupyter Notebook1 0
vkurenkov/awesome-offline-rl
An index of algorithms for offline reinforcement learning (offline-rl)
0 0
vkurenkov/case-212
Открытое письмо специалистов IT-индустрии в защиту фигурантов «московского дела»
Language:Python0 0
vkurenkov/catalyst
Accelerated deep learning R&D
Language:Python0 0
vkurenkov/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Language:Python
vkurenkov/D4RL
A collection of reference environments for offline reinforcement learning
Language:Python0 0
vkurenkov/DeepRL-Grounding
Train an RL agent to execute natural language instructions in a 3D Environment (PyTorch)
Language:Python2 0
vkurenkov/fastapi-basic-mongodb-example
Basic Structure for FastAPI that uses Motor (Async MongoDB Driver)
Language:Python0 0
vkurenkov/gym-minigrid
Minimalistic gridworld package for OpenAI Gym
Language:Python0 0
vkurenkov/ICPy
Language:Python0 0
vkurenkov/lightfm
A Python implementation of LightFM, a hybrid recommendation algorithm.
Language:Python0 0
vkurenkov/muon-detection
Language:Jupyter Notebook2 0
vkurenkov/pytorch-a3c
PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".
Language:Python0 0
vkurenkov/researcher
A jekyll based resume template
Language:HTML0 0
vkurenkov/rlax
Language:Python0 0
vkurenkov/rrc_simulation
Simulation for the Real Robot Challenge (https://real-robot-challenge.com)
Language:Python0 0
vkurenkov/TD3_BC
Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL
Language:Python0 0
vkurenkov/vkurenkov.github.io
Language:HTML1 0

vkurenkov

Pinned Repositories

ad-eps

CORL

headless-ad

xland-minigrid

ReBRAC

sac-rnd

guided-es-by-differentiable-simulators

haxball-chameleon

hierarchical-skill-acquisition

language-grounding-multigoal

vkurenkov's Repositories

vkurenkov/haxball-chameleon

vkurenkov/hierarchical-skill-acquisition

vkurenkov/language-grounding-multigoal

vkurenkov/guided-es-by-differentiable-simulators

vkurenkov/cem-tetris

vkurenkov/graphy-db

vkurenkov/tensegrity

vkurenkov/bcr-project

vkurenkov/annealed-salesman

vkurenkov/awesome-offline-rl

vkurenkov/case-212

vkurenkov/catalyst

vkurenkov/cleanrl

vkurenkov/D4RL

vkurenkov/DeepRL-Grounding

vkurenkov/fastapi-basic-mongodb-example

vkurenkov/gym-minigrid

vkurenkov/ICPy

vkurenkov/lightfm

vkurenkov/muon-detection

vkurenkov/pytorch-a3c

vkurenkov/researcher

vkurenkov/rlax

vkurenkov/rrc_simulation

vkurenkov/TD3_BC

vkurenkov/vkurenkov.github.io