Pinned Repositories
TimeChamber
A Massively Parallel Large Scale Self-Play Framework
acm-challenge-workbook
《挑战程序设计竞赛》习题册攻略
AdversarialNetsPapers
The classical paper list with code about generative adversarial nets
awesome
😎 Awesome lists about all kinds of interesting topics
awesome-public-datasets
A topic-centric list of HQ open datasets. PR ☛☛☛
Awesome-PyTorch-Chinese
【干货】史上最全的PyTorch学习资源汇总
CRL
DeepMARL-PyTorch
Reinforcement Learning Codes
GRIP_Plus_Plus
rl_games
RL implementations
ZiyiLiubird's Repositories
ZiyiLiubird/GRIP_Plus_Plus
ZiyiLiubird/DeepMARL-PyTorch
Reinforcement Learning Codes
ZiyiLiubird/rl_games
RL implementations
ZiyiLiubird/CRL
ZiyiLiubird/Demo
Demo repo for tutotial articles on Opensource.com
ZiyiLiubird/EVO-PopulationBasedTraining
Population-Based Training (PBT) for Reinforcement Learning using Message Passing Interface (MPI)
ZiyiLiubird/RejectSampling
ZiyiLiubird/RPG
This is the source code of RPG (Reward-Randomized Policy Gradient)
ZiyiLiubird/SRPPO
ZiyiLiubird/tianshou
An elegant PyTorch deep reinforcement learning library.
ZiyiLiubird/A1-QP-MPC-Controller
An open source implementation of MIT Cheetah 3 controllers
ZiyiLiubird/AgentLite
Customized AgentLite
ZiyiLiubird/AI-Toolbox
A C++ framework for MDPs and POMDPs with Python bindings
ZiyiLiubird/CDS
[NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.
ZiyiLiubird/Competition_Football
ZiyiLiubird/Competition_Olympics-Integrated
ZiyiLiubird/ContrastiveReflexion
ZiyiLiubird/CoPO
[NeurIPS 2021] Official implementation of paper "Learning to Simulate Self-driven Particles System with Coordinated Policy Optimization".
ZiyiLiubird/cppjson
a light json lib using c++20
ZiyiLiubird/crazyflie-clients-python
Host applications and library for Crazyflie written in Python.
ZiyiLiubird/Cxx_HOPL4_zh
Chinese translation of Bjarne Stroustrup's HOPL4 paper
ZiyiLiubird/football
Check out the new game server:
ZiyiLiubird/Gym-Stag-Hunt
A custom OpenAI Gym environment that implements various Stag Hunt games for reinforcement learning experiments.
ZiyiLiubird/LangChain_Examples
ZiyiLiubird/open-instruct
ZiyiLiubird/OpenRLHF
A Ray-based High-performance RLHF framework (support 70B+ models)
ZiyiLiubird/reflexion
[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning
ZiyiLiubird/safe-rlhf
Safe-RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
ZiyiLiubird/TimeChamber
A Massively Parallel Large Scale Self-Play Framework
ZiyiLiubird/ZiyiLiubird.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes