Pinned Repositories
GuLiMall
C-Thread-Pool
A minimal but powerful thread pool in ANSI C
CloseAirCombat
An environment based on JSBSIM aimed at one-to-one close air combat.
DRL
Deep Reinforcement Learning
gvisor
Application Kernel for Containers
kilm
leetcode
leetcode 刷题
pymarl
Python Multi-Agent Reinforcement Learning framework
pymarl2
Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)
RL4LMs
A modular RL library to fine-tune language models to human preferences
xwqianbei's Repositories
xwqianbei/GuLiMall
xwqianbei/C-Thread-Pool
A minimal but powerful thread pool in ANSI C
xwqianbei/CloseAirCombat
An environment based on JSBSIM aimed at one-to-one close air combat.
xwqianbei/DRL
Deep Reinforcement Learning
xwqianbei/gvisor
Application Kernel for Containers
xwqianbei/kilm
xwqianbei/leetcode
leetcode 刷题
xwqianbei/pymarl
Python Multi-Agent Reinforcement Learning framework
xwqianbei/pymarl2
Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)
xwqianbei/RL4LMs
A modular RL library to fine-tune language models to human preferences
xwqianbei/trl
Train transformer language models with reinforcement learning.
xwqianbei/UCAS_Algorithm_Design_Course-LYG
UCAS-2022秋季学期计算机算法设计与分析(刘玉贵老师)课程资料总结
xwqianbei/xwqianbei.github.io
xwqianbei/zl_threadpool
Linux平台下C++(C++98、C++03、C++11)实现的线程池