zyfsjycc's Stars
DarkDawn233/SeCA
Codes of SeCA accompanying the paper "Sequential Cooperative Multi-Agent Reinforcement Learning"(AAMAS 2023). SeCA is a sequential credit assignment method that factorizes and simplifies the complex interaction analysis of multi-agent systems into a sequential evaluation process for more efficient learning.
zyfsjycc/GoMARL
Codes of GoMARL accompanying the paper "Automatic Grouping for Efficient Cooperative Multi-Agent Reinforcement Learning"(NeurIPS 2023). GoMARL is a domain-agnostic MARL method that learns automatic grouping for efficient cooperation by promoting intra- and inter-group coordination.
rpSebastian/PDCFRPlus
Code for "Minimizing Weighted Counterfactual Regret with Optimistic Online Mirror Descent", IJCAI 2024 (Oral)
rpSebastian/DDCFR
Code for "Dynamic Discounted Counterfactual Regret Minimization", ICLR 2024 (Spotlight)
rpSebastian/AutoCFR
Code for "AutoCFR: Learning to Design Counterfatual Regret Minimization Algorithms", AAAI 2022 (Oral)
wwxFromTju/deepmind_MAS_enviroment
some Multiagent enviroment in 《Multi-agent Reinforcement Learning in Sequential Social Dilemmas》 and 《Value-Decomposition Networks For Cooperative Multi-Agent Learning》
yuchenlin/rebiber
A simple tool to update bib entries with their official information (e.g., DBLP or the ACL anthology).
labmlai/annotated_deep_learning_paper_implementations
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
marl-book/codebase
Official code repo for the MARL book (www.marl-book.com)
OpenRL-Lab/openrl
Unified Reinforcement Learning Framework
opendilab/PPOxFamily
PPO x Family DRL Tutorial Course(决策智能入门级公开课:8节课帮你盘清算法理论,理顺代码逻辑,玩转决策AI应用实践 )
micahcarroll/uniMASK
Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"
Replicable-MARL/MARLlib
One repository is all that is necessary for Multi-agent Reinforcement Learning (MARL)
DLR-RM/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
wwxFromTju/awesome-reinforcement-learning-lib
GitHub's code repository is all you need
vwxyzjn/ppo-implementation-details
The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization
TJU-DRL-LAB/AI-Optimizer
The next generation deep reinforcement learning tookit
acmi-lab/cmu-10717-the-art-of-the-paper
Official repository for CMU Machine Learning Department's 10717: "The Art of the Paper".
SerpentAI/SerpentAI
Game Agent Framework. Helping you create AIs / Bots that learn to play any game you own!
opendilab/DI-engine
OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.
uoe-agents/epymarl
An extension of the PyMARL codebase that includes additional algorithms and environment support
datawhalechina/fantastic-matplotlib
Matplotlib中文教程,在线阅读地址:https://datawhalechina.github.io/fantastic-matplotlib/
datawhalechina/easy-rl
强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/
google-deepmind/dm_control
Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.
google-deepmind/mujoco
Multi-Joint dynamics with Contact. A general purpose physics simulator.
mli/paper-reading
深度学习经典、新论文逐段精读
hahayonghuming/VDACs
Value-Decomposition Multi-Agent Actor-Critics
probml/pml-book
"Probabilistic Machine Learning" - a book series by Kevin Murphy
google/brax
Massively parallel rigidbody physics simulation on accelerator hardware.
h4pZ/rose-pine-matplotlib
All natural pine, faux fur and a bit of soho vibes for the classy minimalist