RZ-Q's Stars
Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
mistralai/mistral-src
Reference implementation of Mistral AI 7B v0.1 model.
01-ai/Yi
A series of large language models trained from scratch by developers @01-ai
Blizzard/s2client-proto
StarCraft II Client - protocol definitions used to communicate with StarCraft II.
google-research/football
Check out the new game server:
eureka-research/Eureka
Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models" (ICLR 2024)
Farama-Foundation/PettingZoo
An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities
openai/multiagent-particle-envs
Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
Farama-Foundation/Minigrid
Simple and easily configurable grid world environments for reinforcement learning
openai/multi-agent-emergence-environments
Environment generation code for the paper "Emergent Tool Use From Multi-Agent Autocurricula"
openai/maddpg
Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
TradeMaster-NTU/TradeMaster
TradeMaster is an open-source platform for quantitative trading empowered by reinforcement learning :fire: :zap: :rainbow:
THUDM/ImageReward
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation
facebookresearch/mbrl-lib
Library for Model Based RL
PKU-MARL/DexterousHands
This is a library that provides dual dexterous hand manipulation tasks through Isaac Gym
BlackSamorez/tensor_parallel
Automatically split your PyTorch models on multiple GPUs for training & inference
starry-sky6688/MADDPG
Pytorch implementation of the MARL algorithm, MADDPG, which correspondings to the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments".
marlbenchmark/off-policy
PyTorch implementations of popular off-policy multi-agent reinforcement learning algorithms, including QMix, VDN, MADDPG, and MATD3.
shacklettbp/madrona
SilenceEagle/paper_downloader
Download papers and supplemental materials from open-access paper website, such as AAAI, AISTATS, COLT, CORL, CVPR, ECCV, ICCV, ICLR, ICML, IJCAI, JMLR, NIPS, RSS, WACV.
oxwhirl/smacv2
cyanrain7/TRPO-in-MARL
oxwhirl/wqmix
Code for Weighted QMIX
TonghanWang/DOP
Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)
bic4907/Overcooked-AI
Offline Multi-Agent Reinforcement Learning Implementations: Solving Overcooked Game with Data-Driven Method
TonghanWang/CASEC-MACO-benchmark
Codes accompanying the paper "Context-Aware Sparse Deep Coordination Graphs (https://arxiv.org/abs/2106.02886).
MarcDV1999/overcooked-explainability
Application of a RL explainability method based on the construction of a Policy Graph that represents the agent's behaviour in a multi-agent RL cooperative environment (Overcooked)
RZ-Q/MARLHF
Multi-agent RLHF/PbRL
sgzZ123/GRE