Aditya-Ramesh-10

Swiss AI Lab IDSIALugano, Switzerland

Aditya-Ramesh-10's Stars

google-deepmind/mujoco
Multi-Joint dynamics with Contact. A general purpose physics simulator.
Language:Jupyter Notebook7.8k 103 1.5k782
PWhiddy/PokemonRedExperiments
Playing Pokemon Red with Reinforcement Learning
Language:Jupyter Notebook6.8k 69 113619
google-research/arxiv-latex-cleaner
arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv
Language:Python5.2k 32 52326
facebookresearch/ReAgent
A platform for Reasoning systems (Reinforcement Learning, Contextual Bandits, etc.)
Language:Python3.6k 148 109516
srush/Tensor-Puzzles
Solve puzzles. Improve your pytorch.
Language:Jupyter Notebook3.1k 12 20252
google-deepmind/mctx
Monte Carlo tree search in JAX
Language:Python2.3k 28 47187
Farama-Foundation/Minigrid
Simple and easily configurable grid world environments for reinforcement learning
Language:Python2.1k 39 188603
Farama-Foundation/ViZDoom
Reinforcement Learning environments based on the 1993 game Doom :godmode:
Language:C++1.7k 50 464397
google-deepmind/bsuite
bsuite is a collection of carefully-designed experiments that investigate core capabilities of a reinforcement learning (RL) agent
Language:Python1.5k 60 31181
eloialonso/iris
Transformers are Sample-Efficient World Models. ICLR 2023, notable top 5%.
Language:Python788 23 2376
ml-jku/baselines-rudder
RUDDER for ATARI games with delayed rewards in OpenAI Baselines package
Language:Python265 16 040
RajGhugare19/dreamerv2
Pytorch implementation of Dreamer-v2: Visual Model Based RL Algorithm.
Language:Python237 3 440
Algomancer/Bayesian-Flow-Networks
A simple implimentation of Bayesian Flow Networks (BFN)
Language:Jupyter Notebook236 8 515
koz4k/dni-pytorch
Decoupled Neural Interfaces using Synthetic Gradients for PyTorch
Language:Python236 10 438
lcswillems/torch-ac
Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO
Language:Python191 8 665
andrewliao11/dni.pytorch
Implement Decoupled Neural Interfaces using Synthetic Gradients in Pytorch
Language:Python117 9 642
facebookresearch/motif
Intrinsic Motivation from Artificial Intelligence Feedback
Language:Python117 6 313
ayulockin/neurips-llm-efficiency-challenge
Starter pack for NeurIPS LLM Efficiency Challenge 2023.
Language:Python115 4 1142
toshikwa/soft-actor-critic.pytorch
PyTorch implementation of Soft Actor-Critic(SAC).
Language:Python94 3 222
google-deepmind/dm_hard_eight
Language:Python85 8 34
facebookresearch/e3b
Official repo for the E3B algorithm described in the paper "Exploration via Elliptical Episodic Bonuses".
Language:Python78 9 313
jonathanmli/Avalon-LLM
This repository contains a LLM benchmark for the social deduction game `Resistance Avalon'
Language:Python69 3 26
jrobine/twm
Transformer-based World Models
Language:Python66 5 48
facebookresearch/svg
On the model-based stochastic value gradient for continuous reinforcement learning
Language:Jupyter Notebook54 6 211
twni2016/Memory-RL
When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)
Language:Python49 2 35
widmi/rudder-a-practical-tutorial
A practical step-by-step guide to applying RUDDER
Language:Jupyter Notebook33 4 014
ml-jku/rudder-demonstration-code
Code for demonstration example-task in RUDDER blog
Language:Python21 5 111
samlobel/CFN
Accompanying Code for "Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning", ICML 2023
Language:Python16 2 33
Aditya-Ramesh-10/exploring-through-rcgvf
Language:Python3 1 10
neuralml/bp_lambda
A TD-like model for learning and using synthetic gradients
Language:Python2 1 01

Aditya-Ramesh-10

Aditya-Ramesh-10's Stars

google-deepmind/mujoco

PWhiddy/PokemonRedExperiments

google-research/arxiv-latex-cleaner

facebookresearch/ReAgent

srush/Tensor-Puzzles

google-deepmind/mctx

Farama-Foundation/Minigrid

Farama-Foundation/ViZDoom

google-deepmind/bsuite

eloialonso/iris

ml-jku/baselines-rudder

RajGhugare19/dreamerv2

Algomancer/Bayesian-Flow-Networks

koz4k/dni-pytorch

lcswillems/torch-ac

andrewliao11/dni.pytorch

facebookresearch/motif

ayulockin/neurips-llm-efficiency-challenge

toshikwa/soft-actor-critic.pytorch

google-deepmind/dm_hard_eight

facebookresearch/e3b

jonathanmli/Avalon-LLM

jrobine/twm

facebookresearch/svg

twni2016/Memory-RL

widmi/rudder-a-practical-tutorial

ml-jku/rudder-demonstration-code

samlobel/CFN

Aditya-Ramesh-10/exploring-through-rcgvf

neuralml/bp_lambda