ZQ2413262560's Stars
zlr20/saferl_kit
Egg-Hu/PURER
Official Pytorch Implementation for "Architecture, Dataset and Model-Scale Agnostic Data-free Meta-Learning" (CVPR-2023)
Egg-Hu/PURER-Plus
PURER-Plus: An Extension of PURER (CVPR-2023)
Egg-Hu/BiDf-MKD
Official Pytorch Implementation for "Learning to Learn from APIs: Black-Box Data-Free Meta-Learning" (ICML-2023)
SvenGronauer/RL-Safety-Algorithms
Implementations of safe reinforcement learning algorithms
nikhilbarhate99/min-decision-transformer
Minimal implementation of Decision Transformer: Reinforcement Learning via Sequence Modeling in PyTorch for mujoco control tasks in OpenAI gym
PKU-Alignment/safety-gymnasium
NeurIPS 2023: Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark
2019ChenGong/RL-Paper-notes
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
binary-husky/gpt_academic
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。
PKU-Alignment/omnisafe
[JMLR] OmniSafe is an infrastructural framework for accelerating SafeRL research.
opendilab/awesome-decision-transformer
A curated list of Decision Transformer resources (continually updated)
akjayant/PPO_Lagrangian_PyTorch
Implementation of PPO Lagrangian in PyTorch
liuzuxin/cvpo-safe-rl
Code for "Constrained Variational Policy Optimization for Safe Reinforcement Learning" (ICML 2022)
p-christ/Deep-Reinforcement-Learning-Algorithms-with-PyTorch
PyTorch implementations of deep reinforcement learning algorithms and environments
chauncygu/Safe-Reinforcement-Learning-Baselines
The repository is for safe reinforcement learning baselines.
ShangtongZhang/DeepRL
Modularized Implementation of Deep RL Algorithms in PyTorch
SvenGronauer/Bullet-Safety-Gym
An open-source framework to benchmark and assess safety specifications of Reinforcement Learning problems.
AI4Finance-Foundation/ElegantRL
Massively Parallel Deep Reinforcement Learning. 🔥
MorvanZhou/Reinforcement-learning-with-tensorflow
Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学
dnddnjs/feudal-montezuma
Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge
haarnoja/sac
Soft Actor-Critic
LantaoYu/MARL-Papers
Paper list of multi-agent reinforcement learning (MARL)
zhoubolei/introRL
Intro to Reinforcement Learning (强化学习纲要)