WilliamWu96

Robustness and Generalization of RL, PhD at the University of Liverpool

WilliamWu96's Stars

tinkoff-ai/CORL
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC
Language:Python1.1k124
zhoubolei/bolei_awesome_posters
CVPR and NeurIPS poster examples and templates. May we have in-person poster session soon!
1.4k126
Ericonaldo/ILSwiss
ILSwiss is an Easy-to-run Imitation Learning (IL, or Learning from Demonstration, LfD) and also Reinforcement Learning (RL) framework (template) in PyTorch.
Language:Python16112
metadriverse/policydissect
[NeurIPS 2022] Official implementation of the paper: "Human-AI Shared Control via Policy Dissection"
Language:Python485
opendilab/awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
3.2k200
PlexPt/awesome-chatgpt-prompts-zh
ChatGPT 中文调教指南。各种场景使用指南。学习怎么让它听你的话。
52.2k13.5k
Cranial-XIX/metric-residual-network
Official PyTorch Implementation for Metric Residual Networks for Sample Efficient Goal-Conditioned Reinforcement Learning
Language:C++14
gwthomas/force
A library for reinforcement learning research
Language:Python548
hmhyau/rl-intention
Language:Python8
chenhongge/StateAdvDRL
[NeurIPS 2020, Spotlight] Code for "Robust Deep Reinforcement Learning against Adversarial Perturbations on Observations"
11018
Plankson/awesome-explainable-reinforcement-learning
A Survey on Explainable Reinforcement Learning: Concepts, Algorithms, Challenges
20221
amirhosseinzlf/STARLA
Search-based Testing Approach of Reinforcement Learning Agent
Language:Jupyter Notebook91
denisyarats/exorl
ExORL: Exploratory Data for Offline Reinforcement Learning
Language:Python1008
rll-research/url_benchmark
Language:Python32850
google/uncertainty-baselines
High-quality implementations of standard and SOTA methods on a variety of tasks.
Language:Python1.4k202
HumanCompatibleAI/imitation
Clean PyTorch implementations of imitation and reward learning algorithms
Language:Python1.3k240
YangRui2015/Model-basedHER
Model-based Hindsight Experience Replay
Language:Python84
clvrai/goal_prox_il
Generalizable Imitation Learning from Observation via Inferring Goal Proximity (NeurIPS 2021)
Language:Python221
chrhenning/posterior_replay_cl
Continual learning of task-specific approximations of the parameter posterior distribution via a shared hypernetwork.
Language:Python163
optimass/continual_learning_papers
Relevant papers in Continual Learning
Language:TeX71082
google-research/google-research
Google Research
Language:Jupyter Notebook33.8k7.8k
snu-mllab/DCPG
Official PyTorch implementation of "Rethinking Value Function Learning for Generalization in Reinforcement Learning" (NeurIPS 2022)
Language:Python144
rraileanu/idaac
Language:Python5314
ikostrikov/pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
Language:Python3.6k831
rraileanu/auto-drac
Automatic Data-Regularized Actor-Critic (Auto-DrAC)
Language:Python10118
sweetice/Deep-reinforcement-learning-with-pytorch
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
Language:Python3.9k841
Lifelong-ML/LPG-FTW
Language:Python203
famura/SimuRLacra
reinforcement learning from randomized simulations
Language:Python6410
lifelong-learning-systems/tella
Framework for Training & Evaluating Lifelong Learning Agents (TELLA)
Language:Python32
GilgameshD/GRADER
This is the official implementation of NeurIPS 2022 paper "Generalizing Goal-Conditioned Reinforcement Learning with Variational Causal Reasoning"
Language:Python304

WilliamWu96

WilliamWu96's Stars

tinkoff-ai/CORL

zhoubolei/bolei_awesome_posters

Ericonaldo/ILSwiss

metadriverse/policydissect

opendilab/awesome-RLHF

PlexPt/awesome-chatgpt-prompts-zh

Cranial-XIX/metric-residual-network

gwthomas/force

hmhyau/rl-intention

chenhongge/StateAdvDRL

Plankson/awesome-explainable-reinforcement-learning

amirhosseinzlf/STARLA

denisyarats/exorl

rll-research/url_benchmark

google/uncertainty-baselines

HumanCompatibleAI/imitation

YangRui2015/Model-basedHER

clvrai/goal_prox_il

chrhenning/posterior_replay_cl

optimass/continual_learning_papers

google-research/google-research

snu-mllab/DCPG

rraileanu/idaac

ikostrikov/pytorch-a2c-ppo-acktr-gail

rraileanu/auto-drac

sweetice/Deep-reinforcement-learning-with-pytorch

Lifelong-ML/LPG-FTW

famura/SimuRLacra

lifelong-learning-systems/tella

GilgameshD/GRADER