tjuHaoXiaotian

tju student

Pinned Repositories

baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Language:Python1 2 00
Deep-Policy-Gradient
Use basic deep reinforcement learning to solve Doom health gathering environment
Language:Jupyter Notebook1 2 00
deepmind_MAS_enviroment
some Multiagent enviroment in 《Multi-agent Reinforcement Learning in Sequential Social Dilemmas》 and 《Value-Decomposition Networks For Cooperative Multi-Agent Learning》
Language:Python1 2 00
GASIL
Independent Generative Adversarial Self-Imitation Learning In Cooperative Multiagent Systems
Language:Python31 2 26
ICML-2020-MSBCB
Code of ICML-2020 paper Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising
Language:Python27 3 19
MA-MuZero
MuZero for Combinatorial Action Spaces: open-source codebase for MA-Gumbel-AlphaZero, MA-Sampled-AlphaZero, MA-Gumbel-MuZero and MA-Sampled-MuZero, from "Multiagent Gumbel MuZero: Efficient Planning in Combinatorial Action Spaces" at AAAI 2024.
11 5 20
pymarl3
We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enhanced algorithm achieves 100% win rates on SMAC-V1 and superior performance on SMAC-V2.
Language:Python123 3 1112
Qfamily_for_MatrixGame
We provide a very simple implementation of the typical value decomposition methods for solving single state Matrix Games.
Language:Python14 2 00
RL_paper
5 4 02
SC1
Language:Python19 5 45

tjuHaoXiaotian's Repositories

tjuHaoXiaotian/baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Language:Python1 2 00
tjuHaoXiaotian/Deep-Policy-Gradient
Use basic deep reinforcement learning to solve Doom health gathering environment
Language:Jupyter Notebook1 2 00
tjuHaoXiaotian/deepmind_MAS_enviroment
some Multiagent enviroment in 《Multi-agent Reinforcement Learning in Sequential Social Dilemmas》 and 《Value-Decomposition Networks For Cooperative Multi-Agent Learning》
Language:Python1 2 00
tjuHaoXiaotian/DQfD
An implement of DQfD（Deep Q-learning from Demonstrations) raised by DeepMind:Learning from Demonstrations for Real World Reinforcement Learning
Language:Python10
tjuHaoXiaotian/dqn-hfo
Language:C++1
tjuHaoXiaotian/improved_wgan_training
Code for reproducing experiments in "Improved Training of Wasserstein GANs"
Language:Python1 2 0
tjuHaoXiaotian/maddpg
Language:Python1 2 0
tjuHaoXiaotian/pysc2-examples
StarCraft II - pysc2 Deep Reinforcement Learning Examples
Language:Python1 2 0
tjuHaoXiaotian/pysc2-tutorial
Tutorials for building a PySC2 bot
Language:Python1
tjuHaoXiaotian/reinforce
reinforcement learning
Language:Jupyter Notebook1 2 0
tjuHaoXiaotian/s2client-api
StarCraft II Client - C++ library supported on Windows, Linux and Mac designed for building scripted bots and research using the SC2API.
Language:C++1
tjuHaoXiaotian/angular-tutorial-damoqiongqiu
Language:CSS2 0
tjuHaoXiaotian/Articles
tjuHaoXiaotian/b-suitors
My implementation of parallel b-suitors algorithm created for Concurrent Programming course at University of Warsaw (2017/2018)
Language:C++
tjuHaoXiaotian/DeepRL-Agents
A set of Deep Reinforcement Learning Agents implemented in Tensorflow.
Language:Jupyter Notebook2 0
tjuHaoXiaotian/dm_control
The DeepMind Control Suite and Package
Language:Python
tjuHaoXiaotian/Font-Awesome
The iconic font and CSS toolkit
Language:HTML2 0
tjuHaoXiaotian/HFO
Half Field Offense in Robocup 2D Soccer
Language:C++
tjuHaoXiaotian/makeSimple
algorithmes classiques implémentés dans le cadre du cours modal programmation efficace à l'Ecole Polytechnique, Palaiseau
tjuHaoXiaotian/pycolab
A highly-customisable gridworld game engine with some batteries included. Make your own gridworld games to test reinforcement learning agents!
Language:Python2 0
tjuHaoXiaotian/reinforcement-learning-code
Language:Python
tjuHaoXiaotian/Reinforcement-learning-with-tensorflow
Simple Reinforcement learning tutorials
Language:Python2 0
tjuHaoXiaotian/Reinforcement_Learning_Blog
This code is written for the blogs
Language:Python2 0
tjuHaoXiaotian/Tensorflow-Tutorial
Tensorflow tutorial from basic to hard
Language:Python
tjuHaoXiaotian/TJU-AI-LabEnv
Language:Jupyter Notebook2 0
tjuHaoXiaotian/wechat-jump-source
跳一跳源代码微信小游戏
Language:Python2 0
tjuHaoXiaotian/Wechat_AutoJump
自动玩微信小游戏跳一跳
Language:Python2 0
tjuHaoXiaotian/wechat_jump_game
python 微信《跳一跳》辅助
Language:Python2 0
tjuHaoXiaotian/WGAN-GP-tensorflow
Tensorflow Implementation of Paper "Improved Training of Wasserstein GANs"
Language:Python
tjuHaoXiaotian/WordCloud
Words Cloud (js)
Language:HTML1