WilliamWu96's Stars
tinkoff-ai/CORL
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC
zhoubolei/bolei_awesome_posters
CVPR and NeurIPS poster examples and templates. May we have in-person poster session soon!
Ericonaldo/ILSwiss
ILSwiss is an Easy-to-run Imitation Learning (IL, or Learning from Demonstration, LfD) and also Reinforcement Learning (RL) framework (template) in PyTorch.
metadriverse/policydissect
[NeurIPS 2022] Official implementation of the paper: "Human-AI Shared Control via Policy Dissection"
opendilab/awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
PlexPt/awesome-chatgpt-prompts-zh
ChatGPT 中文调教指南。各种场景使用指南。学习怎么让它听你的话。
Cranial-XIX/metric-residual-network
Official PyTorch Implementation for Metric Residual Networks for Sample Efficient Goal-Conditioned Reinforcement Learning
gwthomas/force
A library for reinforcement learning research
hmhyau/rl-intention
chenhongge/StateAdvDRL
[NeurIPS 2020, Spotlight] Code for "Robust Deep Reinforcement Learning against Adversarial Perturbations on Observations"
Plankson/awesome-explainable-reinforcement-learning
A Survey on Explainable Reinforcement Learning: Concepts, Algorithms, Challenges
amirhosseinzlf/STARLA
Search-based Testing Approach of Reinforcement Learning Agent
denisyarats/exorl
ExORL: Exploratory Data for Offline Reinforcement Learning
rll-research/url_benchmark
google/uncertainty-baselines
High-quality implementations of standard and SOTA methods on a variety of tasks.
HumanCompatibleAI/imitation
Clean PyTorch implementations of imitation and reward learning algorithms
YangRui2015/Model-basedHER
Model-based Hindsight Experience Replay
clvrai/goal_prox_il
Generalizable Imitation Learning from Observation via Inferring Goal Proximity (NeurIPS 2021)
chrhenning/posterior_replay_cl
Continual learning of task-specific approximations of the parameter posterior distribution via a shared hypernetwork.
optimass/continual_learning_papers
Relevant papers in Continual Learning
google-research/google-research
Google Research
snu-mllab/DCPG
Official PyTorch implementation of "Rethinking Value Function Learning for Generalization in Reinforcement Learning" (NeurIPS 2022)
rraileanu/idaac
ikostrikov/pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
rraileanu/auto-drac
Automatic Data-Regularized Actor-Critic (Auto-DrAC)
sweetice/Deep-reinforcement-learning-with-pytorch
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
Lifelong-ML/LPG-FTW
famura/SimuRLacra
reinforcement learning from randomized simulations
lifelong-learning-systems/tella
Framework for Training & Evaluating Lifelong Learning Agents (TELLA)
GilgameshD/GRADER
This is the official implementation of NeurIPS 2022 paper "Generalizing Goal-Conditioned Reinforcement Learning with Variational Causal Reasoning"