Pinned Repositories
60_Days_RL_Challenge
Learn Deep Reinforcement Learning in Depth in 60 days
ALRS
Automated Lecture Recording System
Chord
A Repo for implementation of Chord Protocol using RPC with custom protocol
CS382-Project
Visualization of DIfferent Sorting Algorithms
DeepRL-Tutorials
Contains high quality implementations of Deep Reinforcement Learning algorithms written in PyTorch
Fuzzy-Control-Inverted-Pendulum
Fuzzy Controller for Inverted Pendulum
jax
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
Nature-of-Log
In this notebook I just discuss some of the properties of log around the stationary point of functions and their behaviour around concave functions
RLDS
Reinfrocement Learning for Dialouge Stragtegy
SC2.0
Smart Containers Implementation code , specifically implementation of code of Hx711 on Intel Edison
Dhawgupta's Repositories
Dhawgupta/jax
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
Dhawgupta/args
Dhawgupta/cmput412-competition1
Dhawgupta/controllable_agent
Dhawgupta/dca
Dynamic channel allocation in cellular networks by reinforcement learning
Dhawgupta/Engine
A library for developing and applying Seldonian algorithms
Dhawgupta/gupta2021structural
Code for NeurIPS 2021 Paper: Structural Credit Assignment in Neural Networks using Reinforcement Learning
Dhawgupta/gupta2023behavior
Code for NeurIPS 2023 Spotlight Paper: Behavior Alignment via Reward Function Optimization
Dhawgupta/gupta2024from
Code for AAAI 2024 Oral: From Past to Future: Rethinking Eligibility Traces
Dhawgupta/hrldm
Repository and Code for the Heirarchial Reinforcement Learning based Dialogue Management System
Dhawgupta/i3-config
My awesome i3 configuration
Dhawgupta/implicit_q_learning
Dhawgupta/JAXSeq
Train very large language models in Jax.
Dhawgupta/learn-julia-the-hard-way
Learn Julia the hard way!
Dhawgupta/LLM_RL
Dhawgupta/LMRL-Gym
The public repo for changes
Dhawgupta/muzero-general
MuZero
Dhawgupta/option-4rooms
Dhawgupta/option-baselines
Dhawgupta/option-critic-pytorch
PyTorch implementation of the Option-Critic framework, Harb et al. 2016
Dhawgupta/personal-webpage
Dhawgupta/pfqi
Dhawgupta/purejaxrl
Really Fast End-to-End Jax RL Implementations
Dhawgupta/pytorch-soft-actor-critic
PyTorch implementation of soft actor critic
Dhawgupta/quantifying_exposure_bias
Accompanying repository for the paper: Why Exposure Bias Matters: An Imitation Learning Perspective of Error Accumulation in Language Generation
Dhawgupta/reading
Dhawgupta/rl-fl
Dhawgupta/RlGlue
Dhawgupta/VSCodeFiles
json files and description for my VS code installation
Dhawgupta/website-data