zyfsjycc

zyfsjycc's Stars

DarkDawn233/SeCA
Codes of SeCA accompanying the paper "Sequential Cooperative Multi-Agent Reinforcement Learning"(AAMAS 2023). SeCA is a sequential credit assignment method that factorizes and simplifies the complex interaction analysis of multi-agent systems into a sequential evaluation process for more efficient learning.
Language:Python53
zyfsjycc/GoMARL
Codes of GoMARL accompanying the paper "Automatic Grouping for Efficient Cooperative Multi-Agent Reinforcement Learning"(NeurIPS 2023). GoMARL is a domain-agnostic MARL method that learns automatic grouping for efficient cooperation by promoting intra- and inter-group coordination.
Language:Python213
rpSebastian/PDCFRPlus
Code for "Minimizing Weighted Counterfactual Regret with Optimistic Online Mirror Descent", IJCAI 2024 (Oral)
Language:Python91
rpSebastian/DDCFR
Code for "Dynamic Discounted Counterfactual Regret Minimization", ICLR 2024 (Spotlight)
Language:Python71
rpSebastian/AutoCFR
Code for "AutoCFR: Learning to Design Counterfatual Regret Minimization Algorithms", AAAI 2022 (Oral)
Language:Python163
wwxFromTju/deepmind_MAS_enviroment
some Multiagent enviroment in 《Multi-agent Reinforcement Learning in Sequential Social Dilemmas》 and 《Value-Decomposition Networks For Cooperative Multi-Agent Learning》
Language:Python13125
yuchenlin/rebiber
A simple tool to update bib entries with their official information (e.g., DBLP or the ACL anthology).
Language:Python2.7k161
labmlai/annotated_deep_learning_paper_implementations
🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
Language:Python57.5k5.9k
marl-book/codebase
Official code repo for the MARL book (www.marl-book.com)
Language:Python41264
OpenRL-Lab/openrl
Unified Reinforcement Learning Framework
Language:Python66163
opendilab/PPOxFamily
PPO x Family DRL Tutorial Course（决策智能入门级公开课：8节课帮你盘清算法理论，理顺代码逻辑，玩转决策AI应用实践）
Language:Python2k179
micahcarroll/uniMASK
Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"
Language:Python544
Replicable-MARL/MARLlib
One repository is all that is necessary for Multi-agent Reinforcement Learning (MARL)
Language:Python968158
DLR-RM/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
Language:Python9.4k1.7k
wwxFromTju/awesome-reinforcement-learning-lib
GitHub's code repository is all you need
33238
vwxyzjn/ppo-implementation-details
The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization
Language:Python66499
TJU-DRL-LAB/AI-Optimizer
The next generation deep reinforcement learning tookit
Language:Python4.9k903
acmi-lab/cmu-10717-the-art-of-the-paper
Official repository for CMU Machine Learning Department's 10717: "The Art of the Paper".
28610
SerpentAI/SerpentAI
Game Agent Framework. Helping you create AIs / Bots that learn to play any game you own!
Language:Python6.8k788
opendilab/DI-engine
OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.
Language:Python3.2k382
uoe-agents/epymarl
An extension of the PyMARL codebase that includes additional algorithms and environment support
Language:Python540144
datawhalechina/fantastic-matplotlib
Matplotlib中文教程，在线阅读地址：https://datawhalechina.github.io/fantastic-matplotlib/
Language:Python471105
datawhalechina/easy-rl
强化学习中文教程（蘑菇书🍄），在线阅读地址：https://datawhalechina.github.io/easy-rl/
Language:Jupyter Notebook9.8k1.9k
google-deepmind/dm_control
Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.
Language:Python3.9k680
google-deepmind/mujoco
Multi-Joint dynamics with Contact. A general purpose physics simulator.
Language:Jupyter Notebook8.4k854
mli/paper-reading
深度学习经典、新论文逐段精读
27.6k2.5k
hahayonghuming/VDACs
Value-Decomposition Multi-Agent Actor-Critics
Language:Python403
probml/pml-book
"Probabilistic Machine Learning" - a book series by Kevin Murphy
Language:Jupyter Notebook5k599
google/brax
Massively parallel rigidbody physics simulation on accelerator hardware.
Language:Jupyter Notebook2.4k259
h4pZ/rose-pine-matplotlib
All natural pine, faux fur and a bit of soho vibes for the classy minimalist
31515