evgenii-nikishin's Stars
MineDojo/Voyager
An Open-Ended Embodied Agent with Large Language Models
vwxyzjn/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
timgaripov/swa
Stochastic Weight Averaging in PyTorch
openai/large-scale-curiosity
Code for the paper "Large-Scale Study of Curiosity-Driven Learning"
google-research/rliable
[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.
deepmind/chex
ikostrikov/jaxrl
JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.
lean-dojo/LeanDojo
Tool for data extraction and interacting with Lean programmatically.
vict0rsch/PaperMemory
Your browser's reference manager: automatic paper detection (Arxiv, OpenReview & more), publication venue matching and code repository discovery! Also enhances ArXiv: BibTex citation, Markdown link, direct download and more!
openai/coinrun
Code for the paper "Quantifying Transfer in Reinforcement Learning"
magenta/midi-ddsp
Synthesis of MIDI with DDSP (https://midi-ddsp.github.io/)
timgaripov/dnn-mode-connectivity
Mode Connectivity and Fast Geometric Ensembles in PyTorch
ikostrikov/implicit_q_learning
lean-dojo/ReProver
Retrieval-Augmented Theorem Provers for Lean
google/trajax
princeton-nlp/intercode
[NeurIPS 2023 D&B] Code repository for InterCode benchmark https://arxiv.org/abs/2306.14898
waterhorse1/ChessGPT
(NeurIPS 2023) ChessGPT - Bridging Policy Learning and Language Modeling
SamsungLabs/tqc_pytorch
Implementation of Truncated Quantile Critics method for continuous reinforcement learning. https://bayesgroup.github.io/tqc/
gehring/fax
LauraRuis/groundedSCAN
Grounded SCAN data set.
nikihowe/myriad
Myriad is a real-world testbed that aims to bridge trajectory optimization and deep learning.
tristandeleu/jax-comln
Code for "Continuous-Time Meta-Learning with Forward Mode Differentiation" (ICLR 2022)
tristandeleu/jax-meta-learning
A collection of meta-learning algorithms in Jax
proceduralia/high_replay_ratio_continuous_control
Efficient seed-parallel implementation of "Breaking the Replay Ratio Barrier"
ShangyuanTong/PairGAN