02KAI

Pinned Repositories

on-policy
This is the official implementation of Multi-Agent PPO (MAPPO).
Language:Python0 0 00
pymarl2
Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)
Language:Python0 0 00
RL_study
Project
Language:Python2 1 00
self-refine
LLMs can generate feedback on their work, use it to improve the output, and repeat this process iteratively.
Language:Python0 0 00

02KAI's Repositories

02KAI/RL_study
Project
Language:Python2 1 00
02KAI/on-policy
This is the official implementation of Multi-Agent PPO (MAPPO).
Language:Python0 0 00
02KAI/pymarl2
Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)
Language:Python0 0 00
02KAI/self-refine
LLMs can generate feedback on their work, use it to improve the output, and repeat this process iteratively.
Language:Python0 0 00