sudo-michael

phd student @ sfu

Vancouver, BC

Pinned Repositories

optimized_dp
Optimizing Dynamic Programming-Based Algorithms
Language:Python107 7 635
atu3
Language:Python0 1 00
avoiding-the-unrechable
Language:MATLAB0 1 00
ciff
Cornell Instruction Following Framework
Language:Python0 0 00
cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features
Language:Python0 0 00
cogail
Co-GAIL: Learning Diverse Strategies for Human-Robot Collaboration
Language:Python0 0 00
cpo
Constrained Policy Optimization
Language:Python0 0 00
cpo-pytorch
An implementation of Constrained Policy Optimization (Achiam 2017) in PyTorch
Language:Python0 0 00
hj-dqn-jax
Language:Python1 1 00
practical-pg
Language:Jupyter Notebook0 1 02

sudo-michael's Repositories

sudo-michael/hj-dqn-jax
Language:Python1 1 00
sudo-michael/atu3
Language:Python0 1 00
sudo-michael/avoiding-the-unrechable
Language:MATLAB0 1 00
sudo-michael/ciff
Cornell Instruction Following Framework
Language:Python0 0 00
sudo-michael/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features
Language:Python0 0 00
sudo-michael/cogail
Co-GAIL: Learning Diverse Strategies for Human-Robot Collaboration
Language:Python0 0 00
sudo-michael/cpo-pytorch
An implementation of Constrained Policy Optimization (Achiam 2017) in PyTorch
Language:Python0 0 00
sudo-michael/dmcgym
Language:Python0 0 00
sudo-michael/practical-pg
Language:Jupyter Notebook0 1 02
sudo-michael/event-jekyll-theme
Jekyll Theme package for your event
Language:HTML0 0
sudo-michael/gail-airl-ppo.pytorch
A PyTorch implementation of GAIL and AIRL based on PPO.
Language:Python0 0
sudo-michael/hazard-world-grid
Language:Python0 0
sudo-michael/helperOC
Language:MATLAB0 0
sudo-michael/neuralcompression.github.io
Language:SCSS0 0
sudo-michael/omnisafe
OmniSafe is an infrastructural framework for accelerating SafeRL research.
Language:Python0 0
sudo-michael/optimized_dp
Optimizing Dynamic Programming-Based Algorithms
Language:Python0 0
sudo-michael/proactive_interventions
Codebase for NeurIPS 2022 paper, "When to Ask for Help: Proactive Interventions in Autonomous Reinforcement Learning"
Language:Python0 0
sudo-michael/pytorch-soft-actor-critic
PyTorch implementation of soft actor critic
Language:Python0 0
sudo-michael/recovery-rl
Implementation of Recovery RL: Safe Reinforcement Learning With Learned Recovery Zones.
Language:Python0 0
sudo-michael/rl-baselines3-zoo
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
Language:Python0 0
sudo-michael/rl-starter-files
RL starter files in order to immediatly train, visualize and evaluate an agent without writing any line of code
Language:Python0 0
sudo-michael/robopianist
🎹 🤖 A benchmark for high-dimensional robot control.
Language:Python0 0
sudo-michael/safe-control-gym
PyBullet CartPole and Quadrotor environments—with CasADi symbolic a priori dynamics—for learning-based control and reinforcement learning
Language:Python0 0
sudo-michael/Safe-MBPO
Code for the NeurIPS 2021 paper "Safe Reinforcement Learning by Imagining the Near Future"
Language:Python0 0
sudo-michael/safety-gym
Tools for accelerating safe exploration research.
Language:Python0 01
sudo-michael/safety-gymnasium
Safety-Gymnaisum is a highly scalable and customizable safe reinforcement learning environment library.
Language:Python0 0
sudo-michael/safety_rl
Language:Python0 0
sudo-michael/sbx
SBX: Stable Baselines Jax (SB3 + Jax)
Language:Python0 0
sudo-michael/siren-jax
Unofficial implementation of Siren with Jax for image representation.
Language:Python0 0
sudo-michael/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
Language:Python0 0