xwqianbei

Pinned Repositories

C-Thread-Pool
A minimal but powerful thread pool in ANSI C
Language:C00
CloseAirCombat
An environment based on JSBSIM aimed at one-to-one close air combat.
Language:Python00
DRL
Deep Reinforcement Learning
00
GuLiMall
20
gvisor
Application Kernel for Containers
Language:Go00
kilm
Language:Python00
leetcode
leetcode 刷题
Language:C++00
pymarl
Python Multi-Agent Reinforcement Learning framework
Language:Python00
pymarl2
Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)
Language:Python00
RL4LMs
A modular RL library to fine-tune language models to human preferences
Language:Python00

xwqianbei's Repositories

xwqianbei/GuLiMall
20
xwqianbei/C-Thread-Pool
A minimal but powerful thread pool in ANSI C
Language:C00
xwqianbei/CloseAirCombat
An environment based on JSBSIM aimed at one-to-one close air combat.
Language:Python00
xwqianbei/DRL
Deep Reinforcement Learning
00
xwqianbei/gvisor
Application Kernel for Containers
Language:Go00
xwqianbei/kilm
Language:Python00
xwqianbei/leetcode
leetcode 刷题
Language:C++00
xwqianbei/pymarl
Python Multi-Agent Reinforcement Learning framework
Language:Python00
xwqianbei/pymarl2
Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)
Language:Python00
xwqianbei/RL4LMs
A modular RL library to fine-tune language models to human preferences
Language:Python00
xwqianbei/trl
Train transformer language models with reinforcement learning.
xwqianbei/UCAS_Algorithm_Design_Course-LYG
UCAS-2022秋季学期计算机算法设计与分析(刘玉贵老师)课程资料总结
xwqianbei/xwqianbei.github.io
Language:HTML
xwqianbei/zl_threadpool
Linux平台下C++(C++98、C++03、C++11)实现的线程池