koulanurag

Applied Scientist 2 at Amazon | LLM for Code | Deep Reinforcement Learning

AmazonNew York, New York

Pinned Repositories

conformal
Conformal prediction is a framework for providing accuracy guarantees on the predictions of a base predictor
Language:Python9 5 14
dream-and-search
Code for "Dream and Search to Control: Latent Space Planning for Continuous Control"
Language:Python11 4 01
gym-cartpole-continuous
CartPole env. with continuous action space
Language:Python7 3 01
gym_x
Gym environments for capture properties of hidden states(hx) of recurrent networks.
Language:Python5 4 00
ma-gym
A collection of multi agent environments based on OpenAI gym.
Language:Python579 8 29106
minimal-marl
Minimal implementation of multi-agent reinforcement learning algorithms
Language:Python50 3 412
mmn
Moore Machine Networks (MMN): Learning Finite-State Representations of Recurrent Policy Networks
Language:Python49 5 313
muzero-pytorch
Pytorch Implementation of MuZero
Language:Python345 21 757
variable-td3
Learning n-step actions for control tasks
Language:Python3 5 00
visTorch
Interacting with Latent Space of AutoEncoder
Language:Python21 3 12

koulanurag's Repositories

koulanurag/ma-gym
A collection of multi agent environments based on OpenAI gym.
Language:Python579 8 29106
koulanurag/muzero-pytorch
Pytorch Implementation of MuZero
Language:Python345 21 757
koulanurag/minimal-marl
Minimal implementation of multi-agent reinforcement learning algorithms
Language:Python50 3 412
koulanurag/mmn
Moore Machine Networks (MMN): Learning Finite-State Representations of Recurrent Policy Networks
Language:Python49 5 313
koulanurag/visTorch
Interacting with Latent Space of AutoEncoder
Language:Python21 3 12
koulanurag/dream-and-search
Code for "Dream and Search to Control: Latent Space Planning for Continuous Control"
Language:Python11 4 01
koulanurag/conformal
Conformal prediction is a framework for providing accuracy guarantees on the predictions of a base predictor
Language:Python9 5 14
koulanurag/gym-cartpole-continuous
CartPole env. with continuous action space
Language:Python7 3 01
koulanurag/gym_x
Gym environments for capture properties of hidden states(hx) of recurrent networks.
Language:Python5 4 00
koulanurag/marl-pytorch
Pytorch Implementations of Multi Agent Reinforcement Learning(marl) algorithms
Language:Python5 3 01
koulanurag/deep-conformal
Applying Conformal Prediction over Deep Neural Nets
Language:Python3 3 01
koulanurag/opcc
Benchmark for "Offline Policy Comparison with Confidence"
Language:Python3 1 00
koulanurag/policybazaar
A collection of multi-quality policies for continuous control tasks.
Language:Python3 4 0
koulanurag/variable-td3
Learning n-step actions for control tasks
Language:Python3 5 00
koulanurag/maze-world
Random maze environments with different size and complexity for reinforcement learning research.
Language:Python1 3 0
koulanurag/opcc-baselines
Baselines for "Offline Policy Comparison with Confidence"
Language:Python1 1 0
koulanurag/pfa
Policy Fusion Architecture (PFA): We investigate policy gradient approaches for reward decomposition in reinforcement Learning
Language:Python1 3 0
koulanurag/tensorboard2seaborn
Plot Tensorflow Summary Event in a Beautiful Way 🌈
Language:Python1 2 01
koulanurag/vpn
PyTorch implementation of Value Prediction Network (VPN) :construction: :construction_worker:
Language:Python1 4 0
koulanurag/pid-pendulum
PID controller for open-ai gym's Pendulum.
Language:Jupyter Notebook0 4 00
koulanurag/abp
A library to create adaptive programs (abp) via Reinforcement Learning
Language:Python5 01
koulanurag/bmi
BMI Dashboard using NodeJs
Language:JavaScript3 0
koulanurag/card-arrangement-game
Card Arrangement Game to introduce statistical notions in fun way :game_die: :black_joker: :slot_machine:
Language:CSS3 0
koulanurag/chatter-nodejs
Trying to make a chat channel similar to IRC. (Inspired by usage of slack)
Language:JavaScript3 0
koulanurag/d4rl
A benchmark for offline reinforcement learning.
Language:Python2 0
koulanurag/device-config-app
Primary purpose of app is to configure Echo Sounders.
Language:CSS3 0
koulanurag/gym-sokoban
Sokoban environment for OpenAI Gym
Language:Python1 0
koulanurag/sokoban-bazaar
Language:Python1 0
koulanurag/sweatram_mean
SweatRam's Dashboard using a Mean Stack
Language:HTML3 0
koulanurag/tweet-node
This project is to analyse real time tweets
Language:CSS3 0