wwxFromTju

Make MAS(DRL) Great Again ! 🐶

DRL/MASTianjin China

Pinned Repositories

OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
Language:Python3.7k 30 389352
MARLlib
One repository is all that is necessary for Multi-agent Reinforcement Learning (MARL)
Language:Python976 11 156160
AR_tju
北洋AR，建立在天津大学录取通知书上，然后可以将学校的图像显示出来
Language:Objective-C3 1 02
awesome-reinforcement-learning-lib
GitHub's code repository is all you need
333 3 140
awesome-reinforcement-learning-zh
中文整理的强化学习资料（Reinforcement Learning）
2k 79 2358
deepmind_MAS_enviroment
some Multiagent enviroment in 《Multi-agent Reinforcement Learning in Sequential Social Dilemmas》 and 《Value-Decomposition Networks For Cooperative Multi-Agent Learning》
Language:Python131 7 125
DRL_trick
33 4 18
maddpg-tf
use tensorflow to implement the MADDPG(simple_tag)
Language:Python18 2 25
MARL-101
just for fun
13 2 03
sc2-101-zh
just for fun
Language:Python23 5 26

wwxFromTju's Repositories

wwxFromTju/MAVEN
Submission for MAVEN: Multi-Agent Variational Exploration
Language:Python1 0 0
wwxFromTju/AAAI-video-slide
2 0
wwxFromTju/alpha-zero-gomoku
A Multi-threaded Implementation of AlphaZero
wwxFromTju/BEAR
Code for Stabilizing Off-Policy RL via Bootstrapping Error Reduction
Language:Python0 0
wwxFromTju/CHER
Curriculum-guided Hindsight Experience Replay (NeurIPS-2019)
wwxFromTju/d2l-pytorch
This project reproduces the book Dive Into Deep Learning (www.d2l.ai), adapting the code from MXNet into PyTorch.
wwxFromTju/difftaichi
10 differentiable physical simulators built with Taichi differentiable programming (DiffTaichi, ICLR 2020)
wwxFromTju/dreamer
Dream to Control: Learning Behaviors by Latent Imagination
Language:Python0 0
wwxFromTju/EITI-EDTI
Influence-Based Multi-Agent Exploration
Language:Python0 0
wwxFromTju/epciclr2020
wwxFromTju/gdrl
Code to go along with the Grokking Deep Reinforcement Learning book
wwxFromTju/MetaIRL
Meta-Inverse Reinforcement Learning with Probabilistic Context Variables
wwxFromTju/modular-assemblies
[NeurIPS 2019] Code for the paper "Learning to Control Self-Assembling Morphologies: A Study of Generalization via Modularity"
wwxFromTju/Multi-Agent-Reinforcement-Learning-Environment
Hello, I pushed some python environments for Multi Agent Reinforcement Learning.
wwxFromTju/mushroom-rl
Python library for Reinforcement Learning experiments.
wwxFromTju/muzero-pytorch
Pytorch Implementation of MuZero
Language:Python0 0
wwxFromTju/once-for-all
[ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment
wwxFromTju/p3s
Implementation of Population-Guided Parallel Policy Search for Reinforcement Learning
Language:Python0 0
wwxFromTju/PARL
PARL A high-performance distributed training framework for Reinforcement Learning
wwxFromTju/policy_transfer
wwxFromTju/pymarl_alpha
Alpha code release for Python Multi-Agent Reinforcement Learning framework
Language:Python1 0
wwxFromTju/pytorch_metric_learning
A flexible and extensible metric learning library, written in PyTorch.
Language:Python0 02
wwxFromTju/ray
A fast and simple framework for building and running distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.
wwxFromTju/rela
Reinforcement Learning Assembly
Language:C++0 0
wwxFromTju/SEPT
Single Episode Policy Transfer in Reinforcement Learning
Language:Python1 0
wwxFromTju/smac
SMAC: The StarCraft Multi-Agent Challenge
Language:Python
wwxFromTju/SV-RL
[ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning
wwxFromTju/train-procgen
Code for the paper "Leveraging Procedural Generation to Benchmark Reinforcement Learning"
Language:Python0 0
wwxFromTju/TRGPPO
wwxFromTju/zhusuan
A library for Bayesian deep learning, generative models, based on Tensorflow