Pinned Repositories
OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
MARLlib
One repository is all that is necessary for Multi-agent Reinforcement Learning (MARL)
AR_tju
北洋AR,建立在天津大学录取通知书上,然后可以将学校的图像显示出来
awesome-reinforcement-learning-lib
GitHub's code repository is all you need
awesome-reinforcement-learning-zh
中文整理的强化学习资料(Reinforcement Learning)
deepmind_MAS_enviroment
some Multiagent enviroment in 《Multi-agent Reinforcement Learning in Sequential Social Dilemmas》 and 《Value-Decomposition Networks For Cooperative Multi-Agent Learning》
DRL_trick
maddpg-tf
use tensorflow to implement the MADDPG(simple_tag)
MARL-101
just for fun
sc2-101-zh
just for fun
wwxFromTju's Repositories
wwxFromTju/MAVEN
Submission for MAVEN: Multi-Agent Variational Exploration
wwxFromTju/AAAI-video-slide
wwxFromTju/alpha-zero-gomoku
A Multi-threaded Implementation of AlphaZero
wwxFromTju/BEAR
Code for Stabilizing Off-Policy RL via Bootstrapping Error Reduction
wwxFromTju/CHER
Curriculum-guided Hindsight Experience Replay (NeurIPS-2019)
wwxFromTju/d2l-pytorch
This project reproduces the book Dive Into Deep Learning (www.d2l.ai), adapting the code from MXNet into PyTorch.
wwxFromTju/difftaichi
10 differentiable physical simulators built with Taichi differentiable programming (DiffTaichi, ICLR 2020)
wwxFromTju/dreamer
Dream to Control: Learning Behaviors by Latent Imagination
wwxFromTju/EITI-EDTI
Influence-Based Multi-Agent Exploration
wwxFromTju/epciclr2020
wwxFromTju/gdrl
Code to go along with the Grokking Deep Reinforcement Learning book
wwxFromTju/MetaIRL
Meta-Inverse Reinforcement Learning with Probabilistic Context Variables
wwxFromTju/modular-assemblies
[NeurIPS 2019] Code for the paper "Learning to Control Self-Assembling Morphologies: A Study of Generalization via Modularity"
wwxFromTju/Multi-Agent-Reinforcement-Learning-Environment
Hello, I pushed some python environments for Multi Agent Reinforcement Learning.
wwxFromTju/mushroom-rl
Python library for Reinforcement Learning experiments.
wwxFromTju/muzero-pytorch
Pytorch Implementation of MuZero
wwxFromTju/once-for-all
[ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment
wwxFromTju/p3s
Implementation of Population-Guided Parallel Policy Search for Reinforcement Learning
wwxFromTju/PARL
PARL A high-performance distributed training framework for Reinforcement Learning
wwxFromTju/policy_transfer
wwxFromTju/pymarl_alpha
Alpha code release for Python Multi-Agent Reinforcement Learning framework
wwxFromTju/pytorch_metric_learning
A flexible and extensible metric learning library, written in PyTorch.
wwxFromTju/ray
A fast and simple framework for building and running distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.
wwxFromTju/rela
Reinforcement Learning Assembly
wwxFromTju/SEPT
Single Episode Policy Transfer in Reinforcement Learning
wwxFromTju/smac
SMAC: The StarCraft Multi-Agent Challenge
wwxFromTju/SV-RL
[ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning
wwxFromTju/train-procgen
Code for the paper "Leveraging Procedural Generation to Benchmark Reinforcement Learning"
wwxFromTju/TRGPPO
wwxFromTju/zhusuan
A library for Bayesian deep learning, generative models, based on Tensorflow