evgenii-nikishin

Mila, Université de MontréalMontreal, QC, Canada

evgenii-nikishin's Stars

MineDojo/Voyager
An Open-Ended Embodied Agent with Large Language Models
Language:JavaScript5.5k 63 146508
vwxyzjn/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Language:Python5.3k 36 182610
timgaripov/swa
Stochastic Weight Averaging in PyTorch
Language:Python960 18 23129
openai/large-scale-curiosity
Code for the paper "Large-Scale Study of Curiosity-Driven Learning"
Language:Python802 61 20180
google-research/rliable
[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.
Language:Jupyter Notebook747 11 1746
deepmind/chex
Language:Python614 17 3534
ikostrikov/jaxrl
JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.
Language:Jupyter Notebook608 12 865
lean-dojo/LeanDojo
Tool for data extraction and interacting with Lean programmatically.
Language:Python544 13 6183
vict0rsch/PaperMemory
Your browser's reference manager: automatic paper detection (Arxiv, OpenReview & more), publication venue matching and code repository discovery! Also enhances ArXiv: BibTex citation, Markdown link, direct download and more!
Language:JavaScript490 7 7617
openai/coinrun
Code for the paper "Quantifying Transfer in Reinforcement Learning"
Language:C++389 154 3285
magenta/midi-ddsp
Synthesis of MIDI with DDSP (https://midi-ddsp.github.io/)
Language:Python300 11 1617
timgaripov/dnn-mode-connectivity
Mode Connectivity and Fast Geometric Ensembles in PyTorch
Language:Python263 13 1043
ikostrikov/implicit_q_learning
Language:Python226 5 938
lean-dojo/ReProver
Retrieval-Augmented Theorem Provers for Lean
Language:Python209 9 2047
google/trajax
Language:Python194 8 1021
princeton-nlp/intercode
[NeurIPS 2023 D&B] Code repository for InterCode benchmark https://arxiv.org/abs/2306.14898
Language:Python182 7 1731
waterhorse1/ChessGPT
(NeurIPS 2023) ChessGPT - Bridging Policy Learning and Language Modeling
Language:Python95 4 47
SamsungLabs/tqc_pytorch
Implementation of Truncated Quantile Critics method for continuous reinforcement learning. https://bayesgroup.github.io/tqc/
Language:Python90 11 616
gehring/fax
Language:Python78 9 199
LauraRuis/groundedSCAN
Grounded SCAN data set.
Language:Python69 8 112
nikihowe/myriad
Myriad is a real-world testbed that aims to bridge trajectory optimization and deep learning.
Language:Python59 2 03
tristandeleu/jax-comln
Code for "Continuous-Time Meta-Learning with Forward Mode Differentiation" (ICLR 2022)
Language:Python27 3 03
tristandeleu/jax-meta-learning
A collection of meta-learning algorithms in Jax
Language:Python23 4 13
proceduralia/high_replay_ratio_continuous_control
Efficient seed-parallel implementation of "Breaking the Replay Ratio Barrier"
Language:Python22 2 04
ShangyuanTong/PairGAN
Language:Jupyter Notebook213

evgenii-nikishin

evgenii-nikishin's Stars

MineDojo/Voyager

vwxyzjn/cleanrl

timgaripov/swa

openai/large-scale-curiosity

google-research/rliable

deepmind/chex

ikostrikov/jaxrl

lean-dojo/LeanDojo

vict0rsch/PaperMemory

openai/coinrun

magenta/midi-ddsp

timgaripov/dnn-mode-connectivity

ikostrikov/implicit_q_learning

lean-dojo/ReProver

google/trajax

princeton-nlp/intercode

waterhorse1/ChessGPT

SamsungLabs/tqc_pytorch

gehring/fax

LauraRuis/groundedSCAN

nikihowe/myriad

tristandeleu/jax-comln

tristandeleu/jax-meta-learning

proceduralia/high_replay_ratio_continuous_control

ShangyuanTong/PairGAN