Aditya-Ramesh-10's Stars
google-deepmind/mujoco
Multi-Joint dynamics with Contact. A general purpose physics simulator.
PWhiddy/PokemonRedExperiments
Playing Pokemon Red with Reinforcement Learning
google-research/arxiv-latex-cleaner
arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv
facebookresearch/ReAgent
A platform for Reasoning systems (Reinforcement Learning, Contextual Bandits, etc.)
srush/Tensor-Puzzles
Solve puzzles. Improve your pytorch.
google-deepmind/mctx
Monte Carlo tree search in JAX
Farama-Foundation/Minigrid
Simple and easily configurable grid world environments for reinforcement learning
Farama-Foundation/ViZDoom
Reinforcement Learning environments based on the 1993 game Doom :godmode:
google-deepmind/bsuite
bsuite is a collection of carefully-designed experiments that investigate core capabilities of a reinforcement learning (RL) agent
eloialonso/iris
Transformers are Sample-Efficient World Models. ICLR 2023, notable top 5%.
ml-jku/baselines-rudder
RUDDER for ATARI games with delayed rewards in OpenAI Baselines package
RajGhugare19/dreamerv2
Pytorch implementation of Dreamer-v2: Visual Model Based RL Algorithm.
Algomancer/Bayesian-Flow-Networks
A simple implimentation of Bayesian Flow Networks (BFN)
koz4k/dni-pytorch
Decoupled Neural Interfaces using Synthetic Gradients for PyTorch
lcswillems/torch-ac
Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO
andrewliao11/dni.pytorch
Implement Decoupled Neural Interfaces using Synthetic Gradients in Pytorch
facebookresearch/motif
Intrinsic Motivation from Artificial Intelligence Feedback
ayulockin/neurips-llm-efficiency-challenge
Starter pack for NeurIPS LLM Efficiency Challenge 2023.
toshikwa/soft-actor-critic.pytorch
PyTorch implementation of Soft Actor-Critic(SAC).
google-deepmind/dm_hard_eight
facebookresearch/e3b
Official repo for the E3B algorithm described in the paper "Exploration via Elliptical Episodic Bonuses".
jonathanmli/Avalon-LLM
This repository contains a LLM benchmark for the social deduction game `Resistance Avalon'
jrobine/twm
Transformer-based World Models
facebookresearch/svg
On the model-based stochastic value gradient for continuous reinforcement learning
twni2016/Memory-RL
When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)
widmi/rudder-a-practical-tutorial
A practical step-by-step guide to applying RUDDER
ml-jku/rudder-demonstration-code
Code for demonstration example-task in RUDDER blog
samlobel/CFN
Accompanying Code for "Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning", ICML 2023
Aditya-Ramesh-10/exploring-through-rcgvf
neuralml/bp_lambda
A TD-like model for learning and using synthetic gradients