makewaerduo

makewaerduo's Stars

ww-rm/ncmdump-py
A simple package used to dump ncm files to mp3 or flac files.
Language:Python248
tedyli/PEP8-Style-Guide-for-Python-Code
Python 代码风格指南 & 编程规范
9329
haoyun0/BCJH-Metropolis
基于模拟退火的爆炒江湖宴会计算器
Language:C++38490
mrahtz/learning-from-human-preferences
Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"
Language:Python30467
agatheminaro/rl-project-human-preferences
Language:Python22
malayandi/DemPrefCode
Accompanying code for the RSS 2019 paper, "Learning Reward Functions by Integrating Human Demonstrations and Preferences"
Language:Python105
csmile-1006/PreferenceTransformer
Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)
Language:Python14717
Lizhi-sjtu/MARL-code-pytorch
Concise pytorch implements of MARL algorithms, including MAPPO, MADDPG, MATD3, QMIX and VDN.
Language:Python41652
Lizhi-sjtu/DRL-code-pytorch
Concise pytorch implements of DRL algorithms, including REINFORCE, A2C, DQN, PPO(discrete and continuous), DDPG, TD3, SAC.
Language:Python1k172
binjie09/chatgpt-web
使用 express 和 vue3 搭建的 ChartGPT 演示网页
Language:Vue4.5k795
tinyzqh/light_mappo
Lightweight version of MAPPO to help you quickly migrate to your local environment.
Language:Python47179
opendilab/awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
3.3k200
HumanCompatibleAI/imitation
Clean PyTorch implementations of imitation and reward learning algorithms
Language:Python1.3k243
ZhiqingXiao/rl-book
Source codes for the book "Reinforcement Learning: Theory and Python Implementation"
Language:HTML876319
XinJingHao/PPO-Continuous-Pytorch
A clean and robust Pytorch implementation of PPO on continuous action space.
Language:Python11516