Pinned Repositories
60_Days_RL_Challenge
Learn Deep Reinforcement Learning in Depth in 60 days
ALRS
Automated Lecture Recording System
Chord
A Repo for implementation of Chord Protocol using RPC with custom protocol
choudhary2024icu
Repository which contains implementation of baselines algorithms including PPO, DQN and SAC for the ICU Sepsis benchmark (https://github.com/icu-sepsis/icu-sepsis), introduced in "ICU-Sepsis: A Benchmark MDP Built from Real Medical Data", accepted in RLC 2024.
CS382-Project
Visualization of DIfferent Sorting Algorithms
DeepRL-Tutorials
Contains high quality implementations of Deep Reinforcement Learning algorithms written in PyTorch
Fuzzy-Control-Inverted-Pendulum
Fuzzy Controller for Inverted Pendulum
gupta2021structural
Code for NeurIPS 2021 Paper: Structural Credit Assignment in Neural Networks using Reinforcement Learning
RLDS
Reinfrocement Learning for Dialouge Stragtegy
SC2.0
Smart Containers Implementation code , specifically implementation of code of Hx711 on Intel Edison
Dhawgupta's Repositories
Dhawgupta/choudhary2024icu
Repository which contains implementation of baselines algorithms including PPO, DQN and SAC for the ICU Sepsis benchmark (https://github.com/icu-sepsis/icu-sepsis), introduced in "ICU-Sepsis: A Benchmark MDP Built from Real Medical Data", accepted in RLC 2024.
Dhawgupta/gupta2021structural
Code for NeurIPS 2021 Paper: Structural Credit Assignment in Neural Networks using Reinforcement Learning
Dhawgupta/jax
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
Dhawgupta/args
Dhawgupta/atari-prediction-benchmark
Dhawgupta/cmput412-competition1
Dhawgupta/controllable_agent
Dhawgupta/dca
Dynamic channel allocation in cellular networks by reinforcement learning
Dhawgupta/Engine
A library for developing and applying Seldonian algorithms
Dhawgupta/gupta2023behavior
Code for NeurIPS 2023 Spotlight Paper: Behavior Alignment via Reward Function Optimization
Dhawgupta/gupta2024from
Code for AAAI 2024 Oral: From Past to Future: Rethinking Eligibility Traces
Dhawgupta/hrldm
Repository and Code for the Heirarchial Reinforcement Learning based Dialogue Management System
Dhawgupta/i3-config
My awesome i3 configuration
Dhawgupta/implicit_q_learning
Dhawgupta/JaxGCRL
Dhawgupta/JAXSeq
Train very large language models in Jax.
Dhawgupta/learn-julia-the-hard-way
Learn Julia the hard way!
Dhawgupta/LLM_RL
Dhawgupta/LMRL-Gym
The public repo for changes
Dhawgupta/option-4rooms
Dhawgupta/option-baselines
Dhawgupta/option-critic-pytorch
PyTorch implementation of the Option-Critic framework, Harb et al. 2016
Dhawgupta/purejaxrl
Really Fast End-to-End Jax RL Implementations
Dhawgupta/pytorch-soft-actor-critic
PyTorch implementation of soft actor critic
Dhawgupta/quantifying_exposure_bias
Accompanying repository for the paper: Why Exposure Bias Matters: An Imitation Learning Perspective of Error Accumulation in Language Generation
Dhawgupta/reading
Dhawgupta/rl-fl
Dhawgupta/RlGlue
Dhawgupta/Stoix
🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL
Dhawgupta/website-data