zhxei1's Stars
lisarah/mdp_path_coordination
thomashirtz/gym-hybrid
Collection of OpenAI parametrized action-space environments.
LijunSun90/pursuitFSC2
POMG algorithm for large-scale pursuit game with partial observation and no communication.
Tviskaron/pogema-baselines
PPO and PyMARL baseline for Pogema environment
marmotlab/PRIMAL2
Training code PRIMAL2 - Public Repo
heidekrueger/bnelearn
A Framework for Equilibrium Learning in Sealed-Bid Auctions
StanfordASL/hj_reachability
Hamilton-Jacobi reachability analysis in JAX.
SafeRoboticsLab/ISAACS
montrealrobotics/active-domainrand
Code repository for Active Domain Randomization (CoRL 2019, https://arxiv.org/abs/1904.04762)
locuslab/convex_adversarial
A method for training neural networks that are provably robust to adversarial attacks.
huanzhang12/CROWN-IBP
Certified defense to adversarial examples using CROWN and IBP. Also includes GPU implementation of CROWN verification algorithm (in PyTorch).
Verified-Intelligence/auto_LiRPA
auto_LiRPA: An Automatic Linear Relaxation based Perturbation Analysis Library for Neural Networks and General Computational Graphs
huanzhang12/RecurJac-and-CROWN
Reference implementations for RecurJac, CROWN, FastLin and FastLip (Neural Network verification and robustness certification algorithms) [Do not use this repo, use https://github.com/Verified-Intelligence/auto_LiRPA instead]
yjhuangcd/FI-ODE
Official implementation for FI-ODE: Certified and Robust Forward Invariance in Neural ODEs.
liuzuxin/safe-rl-robustness
Code for "On the Robustness of Safe Reinforcement Learning under Observational Perturbations" (ICLR 2023)
Hadisalman/robust-verify-benchmark
Benchmark for LP-relaxed robustness verification of ReLU-networks
cduan2020/LocalizedControl
Controllability analysis, driver placement, and optimal control design on large-scale dynamical networks using the concept of information neighborhood.
tinkoff-ai/CORL
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC
umd-huang-lab/WocaR-RL
Efficient Adversarial Training without Attacking: Worst-Case-Aware Robust Reinforcement Learning
RobustFieldAutonomyLab/Stochastic_Road_Network
[UR 2023] Robust Route Planning with Distributional Reinforcement Learning in a Stochastic Road Network Environment
ythuangyt/Robust-Reinforcement-Learning-via-Adversarial-training-with-Langevin-Dynamics
Author implementations of paper "Robust Reinforcement Learning via Adversarial training with Langevin Dynamics"
vwxyzjn/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
zaiyan-x/RFQI
Implementation of Robust Reinforcement Learning using Offline Data [NeurIPS'22]
tesslerc/ActionRobustRL
Code accompanying the paper "Action Robust Reinforcement Learning and Applications in Continuous Control" https://arxiv.org/abs/1901.09184
nsidn98/Robust-Reinforcement-Learning
Reinforcement Learning CS6700 Course Capstone Project
huanzhang12/SA_PPO
[NeurIPS 2020 Spotlight] State-adversarial PPO for robust deep reinforcement learning
chenhongge/StateAdvDRL
[NeurIPS 2020, Spotlight] Code for "Robust Deep Reinforcement Learning against Adversarial Perturbations on Observations"
huanzhang12/SA_DDPG
[NeurIPS 2020 Spotlight] State-adversarial DDPG for robust deep reinforcement learning
nolanwagener/safe_rl
Implementations of SAILR, PDO, and CSC
google-deepmind/dm_control
Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.