gkswamy98

PhD Student @ CMU RI. MS from Berkeley. Summers @google @MicrosoftResearch, @aurora-opensource, @NVIDIA, and @ SpaceX.

Pinned Repositories

cmu-10732-robustness-adaptivity-shift
Official repository for CMU Machine Learning Department's 10732: Robustness and Adaptivity in Shifting Environments
73 12 12
jaxirl
Contains JAX implementation of algorithms for inverse reinforcement learning
Language:Python69 6 21
causal_il
Contains implementation of the DoubIL and ResiduIL algorithms from the ICML '22 paper Causal Imitation Learning under Temporally Correlated Noise.
Language:Jupyter Notebook10 2 00
dotfiles
some of my configs
Language:Emacs Lisp9 2 00
fast_irl
Contains implementation of the FILTER algorithm for exponentially faster inverse reinforcement learning.
Language:Jupyter Notebook49 4 15
imessage-spam-detection
Language:Swift36 3 07
pillbox
Contains implementation of AdVIL, AdRIL, and DAeQuIL algorithms from the ICML '21 Paper Of Moments and Matching.
Language:Jupyter Notebook21 2 04
sequence_model_il
Contains sequence-model implementations of on and off-policy imitation learning algorithms for problems with unobserved contexts.
Language:Jupyter Notebook5 2 00
valuedice
Fork of ValueDICE code that supports discrete action spaces, pybullet, and is truly off-policy.
Language:Python4 2 00
garage
⚡️ Shockingly fast imitation learning algorithms via combining online and offline data engines. ⚡️
Language:Python92

gkswamy98's Repositories

gkswamy98/fast_irl
Contains implementation of the FILTER algorithm for exponentially faster inverse reinforcement learning.
Language:Jupyter Notebook49 4 15
gkswamy98/pillbox
Contains implementation of AdVIL, AdRIL, and DAeQuIL algorithms from the ICML '21 Paper Of Moments and Matching.
Language:Jupyter Notebook21 2 04
gkswamy98/causal_il
Contains implementation of the DoubIL and ResiduIL algorithms from the ICML '22 paper Causal Imitation Learning under Temporally Correlated Noise.
Language:Jupyter Notebook10 2 00
gkswamy98/dotfiles
some of my configs
Language:Emacs Lisp9 2 00
gkswamy98/sequence_model_il
Contains sequence-model implementations of on and off-policy imitation learning algorithms for problems with unobserved contexts.
Language:Jupyter Notebook5 2 00
gkswamy98/valuedice
Fork of ValueDICE code that supports discrete action spaces, pybullet, and is truly off-policy.
Language:Python4 2 00
gkswamy98/meta-rl-bci
meta learning + maxent deeprl for shared autonomy from eeg signals
Language:Python3 3 00
gkswamy98/replay_est
Contains implementation of the replay estimation algorithm from "Minimax Optimal Online Imitation Learning via Replay Estimation."
Language:Python3 3 01
gkswamy98/adversarial_rl
CS 294-131 Project, reimplementing https://arxiv.org/pdf/1702.02284.pdf.
Language:Python1 4 01
gkswamy98/ViZDoom
Doom-based AI Research Platform for Reinforcement Learning from Raw Visual Information. :godmode:
Language:C++1 2 01
gkswamy98/icl
Language:HTML0 1 00
gkswamy98/autosort
Human assisted few-shot object sorting
3 0
gkswamy98/baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Language:Python2 0
gkswamy98/causil
Language:HTML2 0
gkswamy98/D4RL
A collection of reference environments for offline reinforcement learning
Language:Python0 0
gkswamy98/DQfD
An implementation of Deep Q-Learning from Demonstrations (DQfD) for playing Atari 2600 video games
Language:Python0 0
gkswamy98/filter
Language:HTML1 0
gkswamy98/gym-minigrid
Minimalistic gridworld package for OpenAI Gym
Language:Python2 0
gkswamy98/hyper
Language:HTML1 0
gkswamy98/il_envs
Language:Jupyter Notebook2 0
gkswamy98/jaco_learning
Control, planning, and learning system for human-robot interaction with a JACO2 7DOF robotic arm.
Language:OpenEdge ABL1 0
gkswamy98/mjrl
Reinforcement learning algorithms for MuJoCo tasks
Language:Python1 0
gkswamy98/mmil
Website for ICML'21 paper.
Language:HTML2 0
gkswamy98/mu4e-dashboard
A dashboard for mu4e (mu for emacs)
Language:Emacs Lisp1 0
gkswamy98/replay
Language:HTML1 0
gkswamy98/sequil
Language:HTML1 0
gkswamy98/serializable
Utilities for creating serializable classes
Language:Python2 0
gkswamy98/softlearning
Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.
Language:Python2 0
gkswamy98/spo
Language:HTML1 0
gkswamy98/valentinp.github.com
My public page.
Language:CSS1 01