xwqianbei

xwqianbei's Stars

zd11024/NaviLLM
[CVPR 2024] The code for paper 'Towards Learning a Generalist Model for Embodied Navigation'
Language:Python1267
FlagOpen/FlagData
Language:Python27130
OpenBMB/ModelCenter
Efficient, Low-Resource, Distributed transformer implementation based on BMTrain
Language:Python24330
Lizhi-sjtu/MARL-code-pytorch
Concise pytorch implements of MARL algorithms, including MAPPO, MADDPG, MATD3, QMIX and VDN.
Language:Python45061
philtabor/Multi-Agent-Reinforcement-Learning
PyTorch implementations of MADDPG, MAPPO (coming)
Language:Python8914
zjunlp/OneGen
[EMNLP 2024 Findings] OneGen: Efficient One-Pass Unified Generation and Retrieval for LLMs.
Language:Python13815
CraftJarvis/RAT
Implementation of "RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation".
Language:Python18323
qhjqhj00/MemoRAG
Empowering RAG with a memory-based data interface for all-purpose applications!
Language:Python1.3k81
lich14/CDS
[NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.
Language:Python8419
NJU-RL/ACORM
Language:Python254
Wei9711/GACG
Offical code for Group-Aware Coordination Graph for Multi-Agent Reinforcement Learning
Language:Python143
Theohhhu/UPDeT
Official Implementation of 'UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers' ICLR 2021(spotlight)
Language:Python12917
koulanurag/ma-gym
A collection of multi agent environments based on OpenAI gym.
Language:Python575103
tobeatraceur/Organized-LLM-Agents
Embodied and organized multi-LLM-agent teams supporting communication for >3 agents. Source codes for the paper "Embodied LLM Agents Learn to Cooperate in Organized Teams".
Language:Python335
LantaoYu/MARL-Papers
Paper list of multi-agent reinforcement learning (MARL)
4.1k729
LAMDA-RL/ODIS
The implementation of ICLR-2023 paper "Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data".
Language:Python385
jannerm/diffuser
Code for the paper "Planning with Diffusion for Flexible Behavior Synthesis"
Language:Python898142
kzl/decision-transformer
Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.
Language:Python2.4k448
oxwhirl/pymarl
Python Multi-Agent Reinforcement Learning framework
Language:Python1.9k387
Replicable-MARL/MARLlib
One repository is all that is necessary for Multi-agent Reinforcement Learning (MARL)
Language:Python946153
BlinkDL/RWKV-LM
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
Language:Python12.7k868
OFA-Sys/InsTag
InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning
2207
hijkzzz/pymarl2
Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)
Language:Python633124
MuQiuJun-AI/bert4pytorch
超轻量级bert的pytorch版本，大量中文注释，容易修改结构，持续更新
Language:Python40967
649453932/Bert-Chinese-Text-Classification-Pytorch
使用Bert，ERNIE，进行中文文本分类
Language:Python4.1k902
vwxyzjn/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Language:Python5.8k656
XinJingHao/DRL-Pytorch
Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)
Language:Python1.5k189
starry-sky6688/MARL-Algorithms
Implementations of IQL, QMIX, VDN, COMA, QTRAN, MAVEN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II
Language:Python1.5k284
TJU-DRL-LAB/AI-Optimizer
The next generation deep reinforcement learning tookit
Language:Python4.8k902
mmrslwan1110/MARL_Agent_and_ENV
IQL, QMIX, VDN, COMA, QTRAN (QTRAN-Base and QTRAN-Alt), MAVEN, CommNet, DYMA-Cl, G2ANet, and MADDPG
Language:Python164