xwqianbei's Stars
zd11024/NaviLLM
[CVPR 2024] The code for paper 'Towards Learning a Generalist Model for Embodied Navigation'
FlagOpen/FlagData
OpenBMB/ModelCenter
Efficient, Low-Resource, Distributed transformer implementation based on BMTrain
Lizhi-sjtu/MARL-code-pytorch
Concise pytorch implements of MARL algorithms, including MAPPO, MADDPG, MATD3, QMIX and VDN.
philtabor/Multi-Agent-Reinforcement-Learning
PyTorch implementations of MADDPG, MAPPO (coming)
zjunlp/OneGen
[EMNLP 2024 Findings] OneGen: Efficient One-Pass Unified Generation and Retrieval for LLMs.
CraftJarvis/RAT
Implementation of "RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation".
qhjqhj00/MemoRAG
Empowering RAG with a memory-based data interface for all-purpose applications!
lich14/CDS
[NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.
NJU-RL/ACORM
Wei9711/GACG
Offical code for Group-Aware Coordination Graph for Multi-Agent Reinforcement Learning
Theohhhu/UPDeT
Official Implementation of 'UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers' ICLR 2021(spotlight)
koulanurag/ma-gym
A collection of multi agent environments based on OpenAI gym.
tobeatraceur/Organized-LLM-Agents
Embodied and organized multi-LLM-agent teams supporting communication for >3 agents. Source codes for the paper "Embodied LLM Agents Learn to Cooperate in Organized Teams".
LantaoYu/MARL-Papers
Paper list of multi-agent reinforcement learning (MARL)
LAMDA-RL/ODIS
The implementation of ICLR-2023 paper "Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data".
jannerm/diffuser
Code for the paper "Planning with Diffusion for Flexible Behavior Synthesis"
kzl/decision-transformer
Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.
oxwhirl/pymarl
Python Multi-Agent Reinforcement Learning framework
Replicable-MARL/MARLlib
One repository is all that is necessary for Multi-agent Reinforcement Learning (MARL)
BlinkDL/RWKV-LM
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
OFA-Sys/InsTag
InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning
hijkzzz/pymarl2
Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)
MuQiuJun-AI/bert4pytorch
超轻量级bert的pytorch版本,大量中文注释,容易修改结构,持续更新
649453932/Bert-Chinese-Text-Classification-Pytorch
使用Bert,ERNIE,进行中文文本分类
vwxyzjn/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
XinJingHao/DRL-Pytorch
Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)
starry-sky6688/MARL-Algorithms
Implementations of IQL, QMIX, VDN, COMA, QTRAN, MAVEN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II
TJU-DRL-LAB/AI-Optimizer
The next generation deep reinforcement learning tookit
mmrslwan1110/MARL_Agent_and_ENV
IQL, QMIX, VDN, COMA, QTRAN (QTRAN-Base and QTRAN-Alt), MAVEN, CommNet, DYMA-Cl, G2ANet, and MADDPG