Pinned Repositories
cite-with-js
LaTeX like citations in HTML with JS
cl_rl_experiments
Continual reinforcement learning experiments in Pytorch
cl_safer_classifiers
Code to accompany SafeAI-20 paper "Simple Continual Learning Strategies for Safer Classifiers"
dockerfiles
Public docker files
ICL
Official Code for ICLR 2023 paper "Learning Soft Constraints from Constrained Expert Demonstrations"
naclports
my ports into native client
nmdc
neo modus direct connect's implementation for python2
nodewiki
a simple wiki in node.js, https://youtu.be/X4XOIAoYpvA
rl.code
rl implementations
stability-biases
Code for Master's Thesis
ashishgaurav13's Repositories
ashishgaurav13/cl_rl_experiments
Continual reinforcement learning experiments in Pytorch
ashishgaurav13/cl_safer_classifiers
Code to accompany SafeAI-20 paper "Simple Continual Learning Strategies for Safer Classifiers"
ashishgaurav13/dockerfiles
Public docker files
ashishgaurav13/rl.code
rl implementations
ashishgaurav13/stability-biases
Code for Master's Thesis
ashishgaurav13/wm2
functional wise-move
ashishgaurav13/adv-bnn
ashishgaurav13/algos
machine learning algorithms
ashishgaurav13/capsulenets
trying capsulenets on fashion mnist, borrowed from https://github.com/XifengGuo/CapsNet-Keras
ashishgaurav13/car_racing
Solution for CarRacing-v0
ashishgaurav13/CausalDiscovery
causal discovery in pyro
ashishgaurav13/cite-with-js
LaTeX like citations in HTML with JS
ashishgaurav13/CS886
Causal Inference in Machine Learning
ashishgaurav13/football
Check out the new game server:
ashishgaurav13/gcastle
trustworthy AI related projects
ashishgaurav13/GridDriving
GridDriving: driving simulator based off CarRacing-v0
ashishgaurav13/gym
A toolkit for developing and comparing reinforcement learning algorithms.
ashishgaurav13/ICL
Official Code for ICLR 2023 paper "Learning Soft Constraints from Constrained Expert Demonstrations"
ashishgaurav13/keras-rl
Deep Reinforcement Learning for Keras.
ashishgaurav13/Learning-Pyro
ashishgaurav13/mcts-tic-tac-toe
Monte Carlo Tree Search for tic tac toe
ashishgaurav13/minerl-baselines
A collection of baselines for the MineRL environment/datasets & the NeurIPS 2019 MineRL competition
ashishgaurav13/pixel-cnn
Code for the paper "PixelCNN++: A PixelCNN Implementation with Discretized Logistic Mixture Likelihood and Other Modifications"
ashishgaurav13/pixelcnn
Pytorch Implementation of OpenAI's PixelCNN++
ashishgaurav13/pong-dqn
ashishgaurav13/PPO_Lagrangian_PyTorch
Implementation of PPO Lagrangian in PyTorch
ashishgaurav13/python-genbadge
A library to generate badges for typical checks (flake8, pytest, coverage, etc.)
ashishgaurav13/snake-dqn-minimal
minimal dqn on snake using tfjs
ashishgaurav13/tensorboard-aggregator
Aggregate multiple tensorboard runs to new summary or csv files
ashishgaurav13/torch_rl
RL starter files in order to immediatly train, visualize and evaluate an agent without writing any line of code