ZQ2413262560

hierarchical reinforcement learning

Tsinghua universityShen Zhen

ZQ2413262560's Stars

zlr20/saferl_kit
Language:Python565
Egg-Hu/PURER
Official Pytorch Implementation for "Architecture, Dataset and Model-Scale Agnostic Data-free Meta-Learning" (CVPR-2023)
Language:Python6
Egg-Hu/PURER-Plus
PURER-Plus: An Extension of PURER (CVPR-2023)
Language:Python3
Egg-Hu/BiDf-MKD
Official Pytorch Implementation for "Learning to Learn from APIs: Black-Box Data-Free Meta-Learning" (ICML-2023)
Language:Python10
SvenGronauer/RL-Safety-Algorithms
Implementations of safe reinforcement learning algorithms
Language:Python194
nikhilbarhate99/min-decision-transformer
Minimal implementation of Decision Transformer: Reinforcement Learning via Sequence Modeling in PyTorch for mujoco control tasks in OpenAI gym
Language:Python24224
PKU-Alignment/safety-gymnasium
NeurIPS 2023: Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark
Language:Python37752
2019ChenGong/RL-Paper-notes
28829
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:Python34.8k4.1k
binary-husky/gpt_academic
为GPT/GLM等LLM大语言模型提供实用化交互接口，特别优化论文阅读/润色/写作体验，模块化设计，支持自定义快捷按钮&函数插件，支持Python和C++等项目剖析&自译解功能，PDF/LaTex论文翻译&总结功能，支持并行问询多种LLM模型，支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。
Language:Python64k7.9k
PKU-Alignment/omnisafe
[JMLR] OmniSafe is an infrastructural framework for accelerating SafeRL research.
Language:Python903129
opendilab/awesome-decision-transformer
A curated list of Decision Transformer resources (continually updated)
67125
akjayant/PPO_Lagrangian_PyTorch
Implementation of PPO Lagrangian in PyTorch
Language:Python3410
liuzuxin/cvpo-safe-rl
Code for "Constrained Variational Policy Optimization for Safe Reinforcement Learning" (ICML 2022)
Language:Python637
p-christ/Deep-Reinforcement-Learning-Algorithms-with-PyTorch
PyTorch implementations of deep reinforcement learning algorithms and environments
Language:Python5.6k1.2k
chauncygu/Safe-Reinforcement-Learning-Baselines
The repository is for safe reinforcement learning baselines.
Language:Jupyter Notebook46775
ShangtongZhang/DeepRL
Modularized Implementation of Deep RL Algorithms in PyTorch
Language:Python3.2k679
SvenGronauer/Bullet-Safety-Gym
An open-source framework to benchmark and assess safety specifications of Reinforcement Learning problems.
Language:Python6113
AI4Finance-Foundation/ElegantRL
Massively Parallel Deep Reinforcement Learning. 🔥
Language:Python3.6k833
MorvanZhou/Reinforcement-learning-with-tensorflow
Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学
Language:Python8.8k5k
dnddnjs/feudal-montezuma
Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge
Language:Python9210
haarnoja/sac
Soft Actor-Critic
Language:Python970233
LantaoYu/MARL-Papers
Paper list of multi-agent reinforcement learning (MARL)
4k720
zhoubolei/introRL
Intro to Reinforcement Learning (强化学习纲要）
3.2k483

ZQ2413262560

ZQ2413262560's Stars

zlr20/saferl_kit

Egg-Hu/PURER

Egg-Hu/PURER-Plus

Egg-Hu/BiDf-MKD

SvenGronauer/RL-Safety-Algorithms

nikhilbarhate99/min-decision-transformer

PKU-Alignment/safety-gymnasium

2019ChenGong/RL-Paper-notes

microsoft/DeepSpeed

binary-husky/gpt_academic

PKU-Alignment/omnisafe

opendilab/awesome-decision-transformer

akjayant/PPO_Lagrangian_PyTorch

liuzuxin/cvpo-safe-rl

p-christ/Deep-Reinforcement-Learning-Algorithms-with-PyTorch

chauncygu/Safe-Reinforcement-Learning-Baselines

ShangtongZhang/DeepRL

SvenGronauer/Bullet-Safety-Gym

AI4Finance-Foundation/ElegantRL

MorvanZhou/Reinforcement-learning-with-tensorflow

dnddnjs/feudal-montezuma

haarnoja/sac

LantaoYu/MARL-Papers

zhoubolei/introRL