Pinned Repositories
baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Deep-Policy-Gradient
Use basic deep reinforcement learning to solve Doom health gathering environment
deepmind_MAS_enviroment
some Multiagent enviroment in 《Multi-agent Reinforcement Learning in Sequential Social Dilemmas》 and 《Value-Decomposition Networks For Cooperative Multi-Agent Learning》
GASIL
Independent Generative Adversarial Self-Imitation Learning In Cooperative Multiagent Systems
ICML-2020-MSBCB
Code of ICML-2020 paper Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising
MA-MuZero
MuZero for Combinatorial Action Spaces: open-source codebase for MA-Gumbel-AlphaZero, MA-Sampled-AlphaZero, MA-Gumbel-MuZero and MA-Sampled-MuZero, from "Multiagent Gumbel MuZero: Efficient Planning in Combinatorial Action Spaces" at AAAI 2024.
pymarl3
We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enhanced algorithm achieves 100% win rates on SMAC-V1 and superior performance on SMAC-V2.
Qfamily_for_MatrixGame
We provide a very simple implementation of the typical value decomposition methods for solving single state Matrix Games.
RL_paper
SC1
tjuHaoXiaotian's Repositories
tjuHaoXiaotian/pymarl3
We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enhanced algorithm achieves 100% win rates on SMAC-V1 and superior performance on SMAC-V2.
tjuHaoXiaotian/GASIL
Independent Generative Adversarial Self-Imitation Learning In Cooperative Multiagent Systems
tjuHaoXiaotian/ICML-2020-MSBCB
Code of ICML-2020 paper Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising
tjuHaoXiaotian/SC1
tjuHaoXiaotian/Qfamily_for_MatrixGame
We provide a very simple implementation of the typical value decomposition methods for solving single state Matrix Games.
tjuHaoXiaotian/MA-MuZero
MuZero for Combinatorial Action Spaces: open-source codebase for MA-Gumbel-AlphaZero, MA-Sampled-AlphaZero, MA-Gumbel-MuZero and MA-Sampled-MuZero, from "Multiagent Gumbel MuZero: Efficient Planning in Combinatorial Action Spaces" at AAAI 2024.
tjuHaoXiaotian/RL_paper
tjuHaoXiaotian/Graph-Neural-Network-Review
tjuHaoXiaotian/InterestDemo
机器学习简单前端(带可视化)小程序
tjuHaoXiaotian/NRLPapers
Must-read papers on network representation learning (NRL) / network embedding (NE)
tjuHaoXiaotian/pymarl_alpha
Alpha code release for Python Multi-Agent Reinforcement Learning framework
tjuHaoXiaotian/smac
SMAC: The StarCraft Multi-Agent Challenge
tjuHaoXiaotian/dien
tjuHaoXiaotian/easy-tf-log
Easy TensorFlow logging for quick prototypes
tjuHaoXiaotian/gym
A toolkit for developing and comparing reinforcement learning algorithms.
tjuHaoXiaotian/Hands-On-Reinforcement-Learning-With-Python
Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow
tjuHaoXiaotian/leetcode-master
LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀
tjuHaoXiaotian/ma-gym
A collection of multi agent environments based on OpenAI gym.
tjuHaoXiaotian/Machine-Learning-Notes
白板推导系列课程笔记 初版
tjuHaoXiaotian/MAgent
A Platform for Many-agent Reinforcement Learning
tjuHaoXiaotian/Markdown4Zhihu
tjuHaoXiaotian/minerl2020_sqil_submission
tjuHaoXiaotian/Paper-Writing-Tips
Paper Writing Tips
tjuHaoXiaotian/PettingZoo
Gym for multi-agent reinforcement learning
tjuHaoXiaotian/PIC
PIC: Permutation Invariant Critic for Multi-Agent Deep Reinforcement Learning
tjuHaoXiaotian/ray
A fast and simple framework for building and running distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.
tjuHaoXiaotian/reinforcement-learning-an-introduction
Python Implementation of Reinforcement Learning: An Introduction
tjuHaoXiaotian/smarts_track2
the track2 code of the SMARTS competition of NIPS-22
tjuHaoXiaotian/the-gan-zoo
A list of all named GANs!
tjuHaoXiaotian/tjuHaoXiaotian.github.io
my blog