makewaerduo's Stars
ww-rm/ncmdump-py
A simple package used to dump ncm files to mp3 or flac files.
tedyli/PEP8-Style-Guide-for-Python-Code
Python 代码风格指南 & 编程规范
haoyun0/BCJH-Metropolis
基于模拟退火的爆炒江湖宴会计算器
mrahtz/learning-from-human-preferences
Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"
agatheminaro/rl-project-human-preferences
malayandi/DemPrefCode
Accompanying code for the RSS 2019 paper, "Learning Reward Functions by Integrating Human Demonstrations and Preferences"
csmile-1006/PreferenceTransformer
Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)
Lizhi-sjtu/MARL-code-pytorch
Concise pytorch implements of MARL algorithms, including MAPPO, MADDPG, MATD3, QMIX and VDN.
Lizhi-sjtu/DRL-code-pytorch
Concise pytorch implements of DRL algorithms, including REINFORCE, A2C, DQN, PPO(discrete and continuous), DDPG, TD3, SAC.
binjie09/chatgpt-web
使用 express 和 vue3 搭建的 ChartGPT 演示网页
tinyzqh/light_mappo
Lightweight version of MAPPO to help you quickly migrate to your local environment.
opendilab/awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
HumanCompatibleAI/imitation
Clean PyTorch implementations of imitation and reward learning algorithms
ZhiqingXiao/rl-book
Source codes for the book "Reinforcement Learning: Theory and Python Implementation"
XinJingHao/PPO-Continuous-Pytorch
A clean and robust Pytorch implementation of PPO on continuous action space.