Pinned Repositories
on-policy
This is the official implementation of Multi-Agent PPO (MAPPO).
pymarl2
Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)
self-refine
LLMs can generate feedback on their work, use it to improve the output, and repeat this process iteratively.
02KAI's Repositories
02KAI/on-policy
This is the official implementation of Multi-Agent PPO (MAPPO).
02KAI/pymarl2
Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)
02KAI/self-refine
LLMs can generate feedback on their work, use it to improve the output, and repeat this process iteratively.