Pinned Repositories
baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Deep-Policy-Gradient
Use basic deep reinforcement learning to solve Doom health gathering environment
deepmind_MAS_enviroment
some Multiagent enviroment in 《Multi-agent Reinforcement Learning in Sequential Social Dilemmas》 and 《Value-Decomposition Networks For Cooperative Multi-Agent Learning》
GASIL
Independent Generative Adversarial Self-Imitation Learning In Cooperative Multiagent Systems
ICML-2020-MSBCB
Code of ICML-2020 paper Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising
MA-MuZero
MuZero for Combinatorial Action Spaces: open-source codebase for MA-Gumbel-AlphaZero, MA-Sampled-AlphaZero, MA-Gumbel-MuZero and MA-Sampled-MuZero, from "Multiagent Gumbel MuZero: Efficient Planning in Combinatorial Action Spaces" at AAAI 2024.
pymarl3
We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enhanced algorithm achieves 100% win rates on SMAC-V1 and superior performance on SMAC-V2.
Qfamily_for_MatrixGame
We provide a very simple implementation of the typical value decomposition methods for solving single state Matrix Games.
RL_paper
SC1
tjuHaoXiaotian's Repositories
tjuHaoXiaotian/baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
tjuHaoXiaotian/Deep-Policy-Gradient
Use basic deep reinforcement learning to solve Doom health gathering environment
tjuHaoXiaotian/deepmind_MAS_enviroment
some Multiagent enviroment in 《Multi-agent Reinforcement Learning in Sequential Social Dilemmas》 and 《Value-Decomposition Networks For Cooperative Multi-Agent Learning》
tjuHaoXiaotian/DQfD
An implement of DQfD(Deep Q-learning from Demonstrations) raised by DeepMind:Learning from Demonstrations for Real World Reinforcement Learning
tjuHaoXiaotian/dqn-hfo
tjuHaoXiaotian/improved_wgan_training
Code for reproducing experiments in "Improved Training of Wasserstein GANs"
tjuHaoXiaotian/maddpg
tjuHaoXiaotian/pysc2-examples
StarCraft II - pysc2 Deep Reinforcement Learning Examples
tjuHaoXiaotian/pysc2-tutorial
Tutorials for building a PySC2 bot
tjuHaoXiaotian/reinforce
reinforcement learning
tjuHaoXiaotian/s2client-api
StarCraft II Client - C++ library supported on Windows, Linux and Mac designed for building scripted bots and research using the SC2API.
tjuHaoXiaotian/angular-tutorial-damoqiongqiu
tjuHaoXiaotian/Articles
tjuHaoXiaotian/b-suitors
My implementation of parallel b-suitors algorithm created for Concurrent Programming course at University of Warsaw (2017/2018)
tjuHaoXiaotian/DeepRL-Agents
A set of Deep Reinforcement Learning Agents implemented in Tensorflow.
tjuHaoXiaotian/dm_control
The DeepMind Control Suite and Package
tjuHaoXiaotian/Font-Awesome
The iconic font and CSS toolkit
tjuHaoXiaotian/HFO
Half Field Offense in Robocup 2D Soccer
tjuHaoXiaotian/makeSimple
algorithmes classiques implémentés dans le cadre du cours modal programmation efficace à l'Ecole Polytechnique, Palaiseau
tjuHaoXiaotian/pycolab
A highly-customisable gridworld game engine with some batteries included. Make your own gridworld games to test reinforcement learning agents!
tjuHaoXiaotian/reinforcement-learning-code
tjuHaoXiaotian/Reinforcement-learning-with-tensorflow
Simple Reinforcement learning tutorials
tjuHaoXiaotian/Reinforcement_Learning_Blog
This code is written for the blogs
tjuHaoXiaotian/Tensorflow-Tutorial
Tensorflow tutorial from basic to hard
tjuHaoXiaotian/TJU-AI-LabEnv
tjuHaoXiaotian/wechat-jump-source
跳一跳源代码 微信小游戏
tjuHaoXiaotian/Wechat_AutoJump
自动玩微信小游戏跳一跳
tjuHaoXiaotian/wechat_jump_game
python 微信《跳一跳》辅助
tjuHaoXiaotian/WGAN-GP-tensorflow
Tensorflow Implementation of Paper "Improved Training of Wasserstein GANs"
tjuHaoXiaotian/WordCloud
Words Cloud (js)