tjuHaoXiaotian

tju student

Pinned Repositories

baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Language:Python1 2 00
Deep-Policy-Gradient
Use basic deep reinforcement learning to solve Doom health gathering environment
Language:Jupyter Notebook1 2 00
deepmind_MAS_enviroment
some Multiagent enviroment in 《Multi-agent Reinforcement Learning in Sequential Social Dilemmas》 and 《Value-Decomposition Networks For Cooperative Multi-Agent Learning》
Language:Python1 2 00
GASIL
Independent Generative Adversarial Self-Imitation Learning In Cooperative Multiagent Systems
Language:Python31 2 26
ICML-2020-MSBCB
Code of ICML-2020 paper Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising
Language:Python26 3 18
MA-MuZero
MuZero for Combinatorial Action Spaces: open-source codebase for MA-Gumbel-AlphaZero, MA-Sampled-AlphaZero, MA-Gumbel-MuZero and MA-Sampled-MuZero, from "Multiagent Gumbel MuZero: Efficient Planning in Combinatorial Action Spaces" at AAAI 2024.
9 4 10
pymarl3
We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enhanced algorithm achieves 100% win rates on SMAC-V1 and superior performance on SMAC-V2.
Language:Python113 3 99
Qfamily_for_MatrixGame
We provide a very simple implementation of the typical value decomposition methods for solving single state Matrix Games.
Language:Python14 2 00
RL_paper
5 4 02
SC1
Language:Python18 5 45

tjuHaoXiaotian's Repositories

tjuHaoXiaotian/pymarl3
We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enhanced algorithm achieves 100% win rates on SMAC-V1 and superior performance on SMAC-V2.
Language:Python113 3 99
tjuHaoXiaotian/GASIL
Independent Generative Adversarial Self-Imitation Learning In Cooperative Multiagent Systems
Language:Python31 2 26
tjuHaoXiaotian/ICML-2020-MSBCB
Code of ICML-2020 paper Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising
Language:Python26 3 18
tjuHaoXiaotian/SC1
Language:Python18 5 45
tjuHaoXiaotian/Qfamily_for_MatrixGame
We provide a very simple implementation of the typical value decomposition methods for solving single state Matrix Games.
Language:Python14 2 00
tjuHaoXiaotian/MA-MuZero
MuZero for Combinatorial Action Spaces: open-source codebase for MA-Gumbel-AlphaZero, MA-Sampled-AlphaZero, MA-Gumbel-MuZero and MA-Sampled-MuZero, from "Multiagent Gumbel MuZero: Efficient Planning in Combinatorial Action Spaces" at AAAI 2024.
9 4 10
tjuHaoXiaotian/RL_paper
5 4 02
tjuHaoXiaotian/Graph-Neural-Network-Review
1 2 0
tjuHaoXiaotian/InterestDemo
机器学习简单前端（带可视化）小程序
Language:JavaScript1 2 0
tjuHaoXiaotian/NRLPapers
Must-read papers on network representation learning (NRL) / network embedding (NE)
Language:TeX1 2 0
tjuHaoXiaotian/pymarl_alpha
Alpha code release for Python Multi-Agent Reinforcement Learning framework
Language:Python1 2 0
tjuHaoXiaotian/smac
SMAC: The StarCraft Multi-Agent Challenge
Language:Python1 2 0
tjuHaoXiaotian/dien
Language:Python1 0
tjuHaoXiaotian/easy-tf-log
Easy TensorFlow logging for quick prototypes
Language:Python2 0
tjuHaoXiaotian/gym
A toolkit for developing and comparing reinforcement learning algorithms.
Language:Python2 0
tjuHaoXiaotian/Hands-On-Reinforcement-Learning-With-Python
Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow
Language:Jupyter Notebook1 0
tjuHaoXiaotian/leetcode-master
LeetCode 刷题攻略：200道经典题目刷题顺序，共60w字的详细图解，视频难点剖析，50余张思维导图，支持C++，Java，Python，Go，JavaScript等多语言版本，从此算法学习不再迷茫！🔥🔥 来看看，你会发现相见恨晚！🚀
1 0
tjuHaoXiaotian/ma-gym
A collection of multi agent environments based on OpenAI gym.
Language:Python1 0
tjuHaoXiaotian/Machine-Learning-Notes
白板推导系列课程笔记初版
1 0
tjuHaoXiaotian/MAgent
A Platform for Many-agent Reinforcement Learning
Language:Python1 0
tjuHaoXiaotian/Markdown4Zhihu
Language:Python2 0
tjuHaoXiaotian/minerl2020_sqil_submission
Language:Python1 0
tjuHaoXiaotian/Paper-Writing-Tips
Paper Writing Tips
1 0
tjuHaoXiaotian/PettingZoo
Gym for multi-agent reinforcement learning
Language:Python1 0
tjuHaoXiaotian/PIC
PIC: Permutation Invariant Critic for Multi-Agent Deep Reinforcement Learning
Language:Python1 0
tjuHaoXiaotian/ray
A fast and simple framework for building and running distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.
Language:Python1 0
tjuHaoXiaotian/reinforcement-learning-an-introduction
Python Implementation of Reinforcement Learning: An Introduction
Language:Python2 0
tjuHaoXiaotian/smarts_track2
the track2 code of the SMARTS competition of NIPS-22
Language:Python2 0
tjuHaoXiaotian/the-gan-zoo
A list of all named GANs!
Language:Python2 0
tjuHaoXiaotian/tjuHaoXiaotian.github.io
my blog
Language:HTML1 0