SSubhnil

PhD candidate at Trinity College Dublin, Ireland. I work on RL, causality, latent variables and multi-agent systems.

Trinity College DublinDublin, Ireland

Pinned Repositories

BAC-DAC-gym
Bayesian Actor-Critic with Neural Networks. Developing an OpenAI Gym toolkit for Bayesian AC reinforcement learning.
Language:Python6 2 01
Causal-Gridworld
Testing the causal implications of the wind in the gridworld environment. The wind is the confounder.
Language:Python00
CausalBench
Official data and code for our paper Systematic Evaluation of Causal Discovery in Visual Model Based Reinforcement Learning
Language:Python0 1 00
CausalCuriosity-test
Causal Curiosity fork for testing in confounded environments.
Language:Python0 0 00
CausalTransformer_exp
Causal Transformer modification for MBRL
Language:Python0 0 00
CDL-bench
Benchmarking CDL in confounded MDP and POMDP settings
Language:Python0 0 00
CoGen_Benchmarking
Benchmarking existing RL algorithms including model-free and model-based approaches on confounded versions of popular environments. Tests generalization and sample efficiency.
Language:Python1 1 00
RacingCARLA
Learning Model Predictive Control (LMPC) for autonomous racing in CARLA 3D environment.
Language:Python22 2 07
RacingLMPC
Language:Python1 2 01
Vehicle-Dynamics-Toolkit
Some advanced tools for race car design - Steady state and transient dynamics, Tyre Data synthesis
Language:MATLAB1 2 01

SSubhnil's Repositories

SSubhnil/RacingCARLA
Learning Model Predictive Control (LMPC) for autonomous racing in CARLA 3D environment.
Language:Python22 2 07
SSubhnil/BAC-DAC-gym
Bayesian Actor-Critic with Neural Networks. Developing an OpenAI Gym toolkit for Bayesian AC reinforcement learning.
Language:Python6 2 01
SSubhnil/CoGen_Benchmarking
Benchmarking existing RL algorithms including model-free and model-based approaches on confounded versions of popular environments. Tests generalization and sample efficiency.
Language:Python1 1 00
SSubhnil/RacingLMPC
Language:Python1 2 01
SSubhnil/Vehicle-Dynamics-Toolkit
Some advanced tools for race car design - Steady state and transient dynamics, Tyre Data synthesis
Language:MATLAB1 2 01
SSubhnil/Causal-Gridworld
Testing the causal implications of the wind in the gridworld environment. The wind is the confounder.
Language:Python00
SSubhnil/CausalBench
Official data and code for our paper Systematic Evaluation of Causal Discovery in Visual Model Based Reinforcement Learning
Language:Python0 1 00
SSubhnil/CausalCuriosity-test
Causal Curiosity fork for testing in confounded environments.
Language:Python0 0 00
SSubhnil/CausalTransformer_exp
Causal Transformer modification for MBRL
Language:Python0 0 00
SSubhnil/CDL-bench
Benchmarking CDL in confounded MDP and POMDP settings
Language:Python0 0 00
SSubhnil/D4PG-bench
Benchmarking D4PG in confounded environements.
Language:Python0 0 00
SSubhnil/dreamerv3-benchmod
Modifying DreamerV3 for benchmarking in confounded environments
Language:Python0 1 00
SSubhnil/mamba-test
Meta-RL Model-Based Algorithm - Confounding tests
Language:Python00
SSubhnil/dreamer-new
Updated version of DreamerV3 cloned from danijar/dreamerv3
Language:Python1 0
SSubhnil/DT-B
Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.
SSubhnil/dv3-torch
Benchmarking DreamerV3 with Plan2Explore.
Language:Python0 0
SSubhnil/FCD-bench
Fine-Grained Causal Dynamics Learning with Quantization for Improving Robustness in Reinforcement Learning (ICML 2024)
Language:Python0 0
SSubhnil/GRADER-bench
Repository for benchmarking GRADER in confounded environments for zero and few-shot generalization.
Language:Python
SSubhnil/mocoda-b
Testing MoCoDA in DM Control Suite and confounded environments.
Language:Python0 0
SSubhnil/mpo-bench
Baseline tests on MPO with unobserved confounders
Language:Python0 0
SSubhnil/MWM-bench
Benchmarking MWM in confounded environments
Language:Python0 0
SSubhnil/P2P-bench
Code accompanying paper "Plan To Predict: Learning an Uncertainty-Foreseeing Model for Model-Based Reinforcement Learning".
Language:Python0 0
SSubhnil/RIA-bench
Benchmarking RIA in confounded environments for zero and few-shot generalization. Now compatible with TF2.
Language:Python0 0
SSubhnil/RIA_base
RIA base version. With new Walker environment similar to DM Control Suite physics and reward function.
Language:Python1 0
SSubhnil/rl2-bench
Implementation of 'RL^2: Fast Reinforcement Learning via Slow Reinforcement Learning'
SSubhnil/sac-bench
PyTorch implementation of Soft Actor-Critic (SAC) for Unobserved Confounders
Language:Jupyter Notebook0 0
SSubhnil/SAC_dmc
SAC implementation for 3D visualization of state transitions
Language:Python
SSubhnil/STORM-mod
Modifying STORM transformer for Causal Transformer
Language:Python0 0
SSubhnil/TMCL-b
Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning (NeurIPS 2020)
SSubhnil/twm-mod
TWM modification