Dhawgupta

Pinned Repositories

60_Days_RL_Challenge
Learn Deep Reinforcement Learning in Depth in 60 days
Language:Jupyter Notebook12
ALRS
Automated Lecture Recording System
Language:Python0 1 00
Chord
A Repo for implementation of Chord Protocol using RPC with custom protocol
Language:Python1 1 00
CS382-Project
Visualization of DIfferent Sorting Algorithms
Language:Java1 1 01
DeepRL-Tutorials
Contains high quality implementations of Deep Reinforcement Learning algorithms written in PyTorch
Language:Jupyter Notebook1 1 01
Fuzzy-Control-Inverted-Pendulum
Fuzzy Controller for Inverted Pendulum
Language:Python0 1 01
jax
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
Language:Python1 1 00
Nature-of-Log
In this notebook I just discuss some of the properties of log around the stationary point of functions and their behaviour around concave functions
Language:Jupyter Notebook11
RLDS
Reinfrocement Learning for Dialouge Stragtegy
Language:Python1 1 00
SC2.0
Smart Containers Implementation code , specifically implementation of code of Hx711 on Intel Edison
Language:C++00

Dhawgupta's Repositories

Dhawgupta/jax
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
Language:Python1 1 00
Dhawgupta/args
Dhawgupta/cmput412-competition1
Language:CMake3 1
Dhawgupta/controllable_agent
Dhawgupta/dca
Dynamic channel allocation in cellular networks by reinforcement learning
Dhawgupta/Engine
A library for developing and applying Seldonian algorithms
Language:Python
Dhawgupta/gupta2021structural
Code for NeurIPS 2021 Paper: Structural Credit Assignment in Neural Networks using Reinforcement Learning
Language:Python
Dhawgupta/gupta2023behavior
Code for NeurIPS 2023 Spotlight Paper: Behavior Alignment via Reward Function Optimization
Language:Python
Dhawgupta/gupta2024from
Code for AAAI 2024 Oral: From Past to Future: Rethinking Eligibility Traces
Language:Python
Dhawgupta/hrldm
Repository and Code for the Heirarchial Reinforcement Learning based Dialogue Management System
Language:Python1
Dhawgupta/i3-config
My awesome i3 configuration
Dhawgupta/implicit_q_learning
Language:Python0 0
Dhawgupta/JAXSeq
Train very large language models in Jax.
Language:Python
Dhawgupta/learn-julia-the-hard-way
Learn Julia the hard way!
Language:Makefile1 0
Dhawgupta/LLM_RL
Language:Python0 01
Dhawgupta/LMRL-Gym
The public repo for changes
Language:Python
Dhawgupta/muzero-general
MuZero
Language:Python1 0
Dhawgupta/option-4rooms
Language:Python1 0
Dhawgupta/option-baselines
Language:Python
Dhawgupta/option-critic-pytorch
PyTorch implementation of the Option-Critic framework, Harb et al. 2016
Language:Python1 0
Dhawgupta/personal-webpage
Language:Shell2 0
Dhawgupta/pfqi
Dhawgupta/purejaxrl
Really Fast End-to-End Jax RL Implementations
Dhawgupta/pytorch-soft-actor-critic
PyTorch implementation of soft actor critic
Language:Python
Dhawgupta/quantifying_exposure_bias
Accompanying repository for the paper: Why Exposure Bias Matters: An Imitation Learning Perspective of Error Accumulation in Language Generation
Language:Python
Dhawgupta/reading
Language:TeX
Dhawgupta/rl-fl
Language:Python
Dhawgupta/RlGlue
Language:Python1 0
Dhawgupta/VSCodeFiles
json files and description for my VS code installation
2 0
Dhawgupta/website-data