zhxei1

zhxei1's Stars

lisarah/mdp_path_coordination
Language:Python2
thomashirtz/gym-hybrid
Collection of OpenAI parametrized action-space environments.
Language:Python559
LijunSun90/pursuitFSC2
POMG algorithm for large-scale pursuit game with partial observation and no communication.
Language:Python182
Tviskaron/pogema-baselines
PPO and PyMARL baseline for Pogema environment
Language:Python205
marmotlab/PRIMAL2
Training code PRIMAL2 - Public Repo
Language:Python14657
heidekrueger/bnelearn
A Framework for Equilibrium Learning in Sealed-Bid Auctions
Language:Jupyter Notebook222
StanfordASL/hj_reachability
Hamilton-Jacobi reachability analysis in JAX.
Language:Python9716
SafeRoboticsLab/ISAACS
Language:Python92
montrealrobotics/active-domainrand
Code repository for Active Domain Randomization (CoRL 2019, https://arxiv.org/abs/1904.04762)
Language:Python9419
locuslab/convex_adversarial
A method for training neural networks that are provably robust to adversarial attacks.
Language:Python37784
huanzhang12/CROWN-IBP
Certified defense to adversarial examples using CROWN and IBP. Also includes GPU implementation of CROWN verification algorithm (in PyTorch).
Language:Python9313
Verified-Intelligence/auto_LiRPA
auto_LiRPA: An Automatic Linear Relaxation based Perturbation Analysis Library for Neural Networks and General Computational Graphs
Language:Python27567
huanzhang12/RecurJac-and-CROWN
Reference implementations for RecurJac, CROWN, FastLin and FastLip (Neural Network verification and robustness certification algorithms) [Do not use this repo, use https://github.com/Verified-Intelligence/auto_LiRPA instead]
Language:Python256
yjhuangcd/FI-ODE
Official implementation for FI-ODE: Certified and Robust Forward Invariance in Neural ODEs.
Language:Python52
liuzuxin/safe-rl-robustness
Code for "On the Robustness of Safe Reinforcement Learning under Observational Perturbations" (ICLR 2023)
Language:Python383
Hadisalman/robust-verify-benchmark
Benchmark for LP-relaxed robustness verification of ReLU-networks
Language:Jupyter Notebook405
cduan2020/LocalizedControl
Controllability analysis, driver placement, and optimal control design on large-scale dynamical networks using the concept of information neighborhood.
Language:MATLAB84
tinkoff-ai/CORL
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC
Language:Python1k118
umd-huang-lab/WocaR-RL
Efficient Adversarial Training without Attacking: Worst-Case-Aware Robust Reinforcement Learning
Language:Python231
RobustFieldAutonomyLab/Stochastic_Road_Network
[UR 2023] Robust Route Planning with Distributional Reinforcement Learning in a Stochastic Road Network Environment
Language:Python162
ythuangyt/Robust-Reinforcement-Learning-via-Adversarial-training-with-Langevin-Dynamics
Author implementations of paper "Robust Reinforcement Learning via Adversarial training with Langevin Dynamics"
Language:Python71
vwxyzjn/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Language:Python5k574
zaiyan-x/RFQI
Implementation of Robust Reinforcement Learning using Offline Data [NeurIPS'22]
Language:Python202
tesslerc/ActionRobustRL
Code accompanying the paper "Action Robust Reinforcement Learning and Applications in Continuous Control" https://arxiv.org/abs/1901.09184
Language:Jupyter Notebook3812
nsidn98/Robust-Reinforcement-Learning
Reinforcement Learning CS6700 Course Capstone Project
Language:Python42
huanzhang12/SA_PPO
[NeurIPS 2020 Spotlight] State-adversarial PPO for robust deep reinforcement learning
Language:Python204
chenhongge/StateAdvDRL
[NeurIPS 2020, Spotlight] Code for "Robust Deep Reinforcement Learning against Adversarial Perturbations on Observations"
10818
huanzhang12/SA_DDPG
[NeurIPS 2020 Spotlight] State-adversarial DDPG for robust deep reinforcement learning
Language:Python84
nolanwagener/safe_rl
Implementations of SAILR, PDO, and CSC
Language:Python298
google-deepmind/dm_control
Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.
Language:Python3.7k652

zhxei1

zhxei1's Stars

lisarah/mdp_path_coordination

thomashirtz/gym-hybrid

LijunSun90/pursuitFSC2

Tviskaron/pogema-baselines

marmotlab/PRIMAL2

heidekrueger/bnelearn

StanfordASL/hj_reachability

SafeRoboticsLab/ISAACS

montrealrobotics/active-domainrand

locuslab/convex_adversarial

huanzhang12/CROWN-IBP

Verified-Intelligence/auto_LiRPA

huanzhang12/RecurJac-and-CROWN

yjhuangcd/FI-ODE

liuzuxin/safe-rl-robustness

Hadisalman/robust-verify-benchmark

cduan2020/LocalizedControl

tinkoff-ai/CORL

umd-huang-lab/WocaR-RL

RobustFieldAutonomyLab/Stochastic_Road_Network

ythuangyt/Robust-Reinforcement-Learning-via-Adversarial-training-with-Langevin-Dynamics

vwxyzjn/cleanrl

zaiyan-x/RFQI

tesslerc/ActionRobustRL

nsidn98/Robust-Reinforcement-Learning

huanzhang12/SA_PPO

chenhongge/StateAdvDRL

huanzhang12/SA_DDPG

nolanwagener/safe_rl

google-deepmind/dm_control