Pinned Repositories
optimized_dp
Optimizing Dynamic Programming-Based Algorithms
atu3
avoiding-the-unrechable
ciff
Cornell Instruction Following Framework
cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features
cogail
Co-GAIL: Learning Diverse Strategies for Human-Robot Collaboration
cpo
Constrained Policy Optimization
cpo-pytorch
An implementation of Constrained Policy Optimization (Achiam 2017) in PyTorch
hj-dqn-jax
practical-pg
sudo-michael's Repositories
sudo-michael/hj-dqn-jax
sudo-michael/atu3
sudo-michael/avoiding-the-unrechable
sudo-michael/ciff
Cornell Instruction Following Framework
sudo-michael/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features
sudo-michael/cogail
Co-GAIL: Learning Diverse Strategies for Human-Robot Collaboration
sudo-michael/cpo-pytorch
An implementation of Constrained Policy Optimization (Achiam 2017) in PyTorch
sudo-michael/dmcgym
sudo-michael/practical-pg
sudo-michael/event-jekyll-theme
Jekyll Theme package for your event
sudo-michael/gail-airl-ppo.pytorch
A PyTorch implementation of GAIL and AIRL based on PPO.
sudo-michael/hazard-world-grid
sudo-michael/helperOC
sudo-michael/neuralcompression.github.io
sudo-michael/omnisafe
OmniSafe is an infrastructural framework for accelerating SafeRL research.
sudo-michael/optimized_dp
Optimizing Dynamic Programming-Based Algorithms
sudo-michael/proactive_interventions
Codebase for NeurIPS 2022 paper, "When to Ask for Help: Proactive Interventions in Autonomous Reinforcement Learning"
sudo-michael/pytorch-soft-actor-critic
PyTorch implementation of soft actor critic
sudo-michael/recovery-rl
Implementation of Recovery RL: Safe Reinforcement Learning With Learned Recovery Zones.
sudo-michael/rl-baselines3-zoo
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
sudo-michael/rl-starter-files
RL starter files in order to immediatly train, visualize and evaluate an agent without writing any line of code
sudo-michael/robopianist
🎹 🤖 A benchmark for high-dimensional robot control.
sudo-michael/safe-control-gym
PyBullet CartPole and Quadrotor environments—with CasADi symbolic a priori dynamics—for learning-based control and reinforcement learning
sudo-michael/Safe-MBPO
Code for the NeurIPS 2021 paper "Safe Reinforcement Learning by Imagining the Near Future"
sudo-michael/safety-gym
Tools for accelerating safe exploration research.
sudo-michael/safety-gymnasium
Safety-Gymnaisum is a highly scalable and customizable safe reinforcement learning environment library.
sudo-michael/safety_rl
sudo-michael/sbx
SBX: Stable Baselines Jax (SB3 + Jax)
sudo-michael/siren-jax
Unofficial implementation of Siren with Jax for image representation.
sudo-michael/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.