RZ-Q

Shitty codes maker

RZ-Q's Stars

Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Language:Python166k 1.6k 2.6k44.1k
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python132k 1.1k 15.7k26.3k
mistralai/mistral-src
Reference implementation of Mistral AI 7B v0.1 model.
Language:Jupyter Notebook8.8k 116 115761
01-ai/Yi
A series of large language models trained from scratch by developers @01-ai
Language:Jupyter Notebook7.6k 108 291468
Blizzard/s2client-proto
StarCraft II Client - protocol definitions used to communicate with StarCraft II.
Language:Python3.8k 196 124430
google-research/football
Check out the new game server:
Language:Python3.3k 94 3181.3k
eureka-research/Eureka
Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models" (ICLR 2024)
Language:Jupyter Notebook2.8k 25 37249
Farama-Foundation/PettingZoo
An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities
Language:Python2.5k 19 369406
openai/multiagent-particle-envs
Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
Language:Python2.3k 177 82785
Farama-Foundation/Minigrid
Simple and easily configurable grid world environments for reinforcement learning
Language:Python2.1k 39 188602
openai/multi-agent-emergence-environments
Environment generation code for the paper "Emergent Tool Use From Multi-Agent Autocurricula"
Language:Python1.6k 187 31302
openai/maddpg
Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
Language:Python1.6k 151 67485
TradeMaster-NTU/TradeMaster
TradeMaster is an open-source platform for quantitative trading empowered by reinforcement learning :fire: :zap: :rainbow:
Language:Jupyter Notebook1.3k 39 66268
THUDM/ImageReward
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation
Language:Python1.1k 14 8460
facebookresearch/mbrl-lib
Library for Model Based RL
Language:Python952 25 67154
PKU-MARL/DexterousHands
This is a library that provides dual dexterous hand manipulation tasks through Isaac Gym
Language:Python619 13 4170
BlackSamorez/tensor_parallel
Automatically split your PyTorch models on multiple GPUs for training & inference
Language:Python614 8 6638
starry-sky6688/MADDPG
Pytorch implementation of the MARL algorithm, MADDPG, which correspondings to the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments".
Language:Python514 5 4180
marlbenchmark/off-policy
PyTorch implementations of popular off-policy multi-agent reinforcement learning algorithms, including QMix, VDN, MADDPG, and MATD3.
Language:Python387 3 1267
shacklettbp/madrona
Language:C++300 9 1130
SilenceEagle/paper_downloader
Download papers and supplemental materials from open-access paper website, such as AAAI, AISTATS, COLT, CORL, CVPR, ECCV, ICCV, ICLR, ICML, IJCAI, JMLR, NIPS, RSS, WACV.
Language:Python229 5 531
oxwhirl/smacv2
Language:Python195 5 3229
cyanrain7/TRPO-in-MARL
Language:Python180 3 1948
oxwhirl/wqmix
Code for Weighted QMIX
Language:Python119 4 734
TonghanWang/DOP
Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)
Language:Python51 2 716
bic4907/Overcooked-AI
Offline Multi-Agent Reinforcement Learning Implementations: Solving Overcooked Game with Data-Driven Method
Language:Python31 2 24
TonghanWang/CASEC-MACO-benchmark
Codes accompanying the paper "Context-Aware Sparse Deep Coordination Graphs (https://arxiv.org/abs/2106.02886).
Language:Python14 1 24
MarcDV1999/overcooked-explainability
Application of a RL explainability method based on the construction of a Policy Graph that represents the agent's behaviour in a multi-agent RL cooperative environment (Overcooked)
Language:Python4 2 01
RZ-Q/MARLHF
Multi-agent RLHF/PbRL
Language:Python40
sgzZ123/GRE
11

RZ-Q

RZ-Q's Stars

Significant-Gravitas/AutoGPT

huggingface/transformers

mistralai/mistral-src

01-ai/Yi

Blizzard/s2client-proto

google-research/football

eureka-research/Eureka

Farama-Foundation/PettingZoo

openai/multiagent-particle-envs

Farama-Foundation/Minigrid

openai/multi-agent-emergence-environments

openai/maddpg

TradeMaster-NTU/TradeMaster

THUDM/ImageReward

facebookresearch/mbrl-lib

PKU-MARL/DexterousHands

BlackSamorez/tensor_parallel

starry-sky6688/MADDPG

marlbenchmark/off-policy

shacklettbp/madrona

SilenceEagle/paper_downloader

oxwhirl/smacv2

cyanrain7/TRPO-in-MARL

oxwhirl/wqmix

TonghanWang/DOP

bic4907/Overcooked-AI

TonghanWang/CASEC-MACO-benchmark

MarcDV1999/overcooked-explainability

RZ-Q/MARLHF

sgzZ123/GRE