wwxFromTju

Make MAS(DRL) Great Again ! 🐶

DRL/MASTianjin China

wwxFromTju's Stars

ray-project/llm-numbers
Numbers every LLM developer should know
4.1k 59 17141
alpa-projects/alpa
Training and serving large-scale neural networks with auto parallelization.
Language:Python3.1k 46 297360
eureka-research/Eureka
Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models" (ICLR 2024)
Language:Jupyter Notebook2.9k 25 39257
google-deepmind/mujoco_menagerie
A collection of high-quality models for the MuJoCo physics engine, curated by Google DeepMind.
Language:Python1.6k 30 81234
openai/Video-Pre-Training
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos
Language:Python1.4k 27 34143
google-deepmind/rlax
Language:Python1.3k 34 2688
facebookresearch/shumai
Fast Differentiable Tensor Library in JavaScript and TypeScript with Bun + Flashlight
Language:TypeScript1.1k 101 3526
d4nj1/TLPUI
A GTK user interface for TLP written in Python
Language:Python1.1k 24 11383
tinkoff-ai/CORL
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC
Language:Python1.1k 17 28134
pytorch/torchdynamo
A Python-level JIT compiler designed to make unmodified PyTorch programs faster.
Language:Python1k 45 567124
instadeepai/jumanji
🕹️ A diverse suite of scalable reinforcement learning environments in JAX
Language:Python672 13 10485
Azure/MS-AMP
Microsoft Automatic Mixed Precision Library
Language:Python542 11 6745
salesforce/CodeRL
This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (NeurIPS22).
Language:Python504 18 2961
FLAIROx/JaxMARL
Multi-Agent Reinforcement Learning with JAX
Language:Python476 10 4093
google-deepmind/alphastar
Language:Python435 11 659
wwxFromTju/awesome-reinforcement-learning-lib
GitHub's code repository is all you need
333 3 140
MineDojo/MineCLIP
Foundation Model for MineDojo
Language:Python249 11 1432
floodsung/LLM-with-RL-papers
A collection of LLM with RL papers
238 8 39
OrigamiDream/gato
Unofficial Gato: A Generalist Agent
Language:Python206 14 430
sotopia-lab/sotopia
Sotopia: an Open-ended Social Learning Environment (ICLR 2024 spotlight)
Language:Python175 3 7423
microsoft/MoCapAct
A Multi-Task Dataset for Simulated Humanoid Control
Language:Python174 12 1023
google-deepmind/s6
Language:C++146 7 011
chandar-lab/RLHive
Language:Python100 9 969
architsharma97/dpo-rlaif
Language:Jupyter Notebook93 3 410
taichi-dev/faster-python-with-taichi
Language:Python80 5 46
gilzamir18/AI4U
AI4U is a plugin that allows you use the Godot Game Engine to specify agents with reinforcement learning. Non-Player Characters (NPCs) of games can be designed using ready-made components.
Language:C#67 8 1911
kvfrans/powderworld
Code for Powderworld: A Platform for Understanding Generalization via Rich Task Distributions
Language:Python63 5 18
manantomar/Mirror-Descent-Policy-Optimization
Mirror Descent Policy Optimization
Language:Python37 2 13
NVlabs/easysim
A library for creating Gym environments with unified API to various physics simulators
Language:Python31 6 07
perrin-isir/xpag
a modular reinforcement learning library with JAX agents
Language:Python22 4 45

wwxFromTju

wwxFromTju's Stars

ray-project/llm-numbers

alpa-projects/alpa

eureka-research/Eureka

google-deepmind/mujoco_menagerie

openai/Video-Pre-Training

google-deepmind/rlax

facebookresearch/shumai

d4nj1/TLPUI

tinkoff-ai/CORL

pytorch/torchdynamo

instadeepai/jumanji

Azure/MS-AMP

salesforce/CodeRL

FLAIROx/JaxMARL

google-deepmind/alphastar

wwxFromTju/awesome-reinforcement-learning-lib

MineDojo/MineCLIP

floodsung/LLM-with-RL-papers

OrigamiDream/gato

sotopia-lab/sotopia

microsoft/MoCapAct

google-deepmind/s6

chandar-lab/RLHive

architsharma97/dpo-rlaif

taichi-dev/faster-python-with-taichi

gilzamir18/AI4U

kvfrans/powderworld

manantomar/Mirror-Descent-Policy-Optimization

NVlabs/easysim

perrin-isir/xpag