tjuHaoXiaotian

tju student

tjuHaoXiaotian's Stars

youngyangyang04/leetcode-master
《代码随想录》LeetCode 刷题攻略：200道经典题目刷题顺序，共60w字的详细图解，视频难点剖析，50余张思维导图，支持C++，Java，Python，Go，JavaScript等多语言版本，从此算法学习不再迷茫！🔥🔥 来看看，你会发现相见恨晚！🚀
Language:Shell49.3k 379 22411.1k
DLR-RM/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
Language:Python8.4k 60 1.4k1.6k
thu-ml/tianshou
An elegant PyTorch deep reinforcement learning library.
Language:Python7.6k 93 7241.1k
PaddlePaddle/models
Officially maintained, supported by PaddlePaddle, including CV, NLP, Speech, Rec, TS, big models and so on.
Language:Python6.9k 270 2k2.9k
LantaoYu/MARL-Papers
Paper list of multi-agent reinforcement learning (MARL)
3.9k 240 9717
MLNLP-World/Paper-Writing-Tips
MLNLP社区用来帮助大家避免论文投稿小错误的整理仓库。 Paper Writing Tips
3.4k 45 3436
higgsfield-ai/higgsfield
Fault-tolerant, highly scalable GPU orchestration, and a machine learning framework designed for training models with billions to trillions of parameters
Language:Jupyter Notebook3.3k 78 1555
opendilab/DI-engine
OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.
Language:Python2.8k 21 196353
seungeunrho/minimalRL
Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)
Language:Python2.8k 49 39456
google-deepmind/mctx
Monte Carlo tree search in JAX
Language:Python2.3k 30 47178
openai/neural-mmo
Code for the paper "Neural MMO: A Massively Multiagent Game Environment for Training and Evaluating Intelligent Agents"
Language:Python1.6k 190 30263
starry-sky6688/MARL-Algorithms
Implementations of IQL, QMIX, VDN, COMA, QTRAN, MAVEN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II
Language:Python1.4k 13 111277
sudharsan13296/Hands-On-Reinforcement-Learning-With-Python
Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow
Language:Jupyter Notebook832 44 3324
Johnson0722/CTR_Prediction
CTR prediction using FM FFM and DeepFM
Language:Python744 21 23299
shariqiqbal2810/MAAC
Code for "Actor-Attention-Critic for Multi-Agent Reinforcement Learning" ICML 2019
Language:Python649 7 38169
hijkzzz/pymarl2
Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)
Language:Python583 16 40113
2019ChenGong/Machine-Learning-Notes
白板推导系列课程笔记初版
489 10 5106
devsisters/pointer-network-tensorflow
TensorFlow implementation of "Pointer Networks"
Language:Python469 24 15138
Theohhhu/UPDeT
Official Implementation of 'UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers' ICLR 2021(spotlight)
Language:Python125 3 1716
tjuHaoXiaotian/pymarl3
We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enhanced algorithm achieves 100% win rates on SMAC-V1 and superior performance on SMAC-V2.
Language:Python116 3 910
wouterkool/stochastic-beam-search
Implementation of Stochastic Beam Search using Fairseq
Language:Python95 7 05
wendelinboehmer/dcg
Language:Python68 5 123
TJU-DRL-LAB/Multiagent-RL
The official code releasement of publications in MARL field of TJU RL lab.
Language:Python41 7 16
tjuHaoXiaotian/GASIL
Independent Generative Adversarial Self-Imitation Learning In Cooperative Multiagent Systems
Language:Python31 2 26
tjuHaoXiaotian/ICML-2020-MSBCB
Code of ICML-2020 paper Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising
Language:Python26 3 18
xtof-durr/makeSimple
algorithmes classiques implémentés dans le cadre du cours modal programmation efficace à l'Ecole Polytechnique, Palaiseau
Language:Java20 2 214
tjuHaoXiaotian/SC1
Language:Python18 5 45
tjuHaoXiaotian/Qfamily_for_MatrixGame
We provide a very simple implementation of the typical value decomposition methods for solving single state Matrix Games.
Language:Python14 2 00
CNDOTA/NeurIPS22-ATM
Language:Python10 2 13
google-research/unique-randomizer
UniqueRandomizer is a data structure for sampling outputs of a randomized program, such as a neural sequence model, incrementally and without replacement.
Language:Python8 6 03

tjuHaoXiaotian

tjuHaoXiaotian's Stars

youngyangyang04/leetcode-master

DLR-RM/stable-baselines3

thu-ml/tianshou

PaddlePaddle/models

LantaoYu/MARL-Papers

MLNLP-World/Paper-Writing-Tips

higgsfield-ai/higgsfield

opendilab/DI-engine

seungeunrho/minimalRL

google-deepmind/mctx

openai/neural-mmo

starry-sky6688/MARL-Algorithms

sudharsan13296/Hands-On-Reinforcement-Learning-With-Python

Johnson0722/CTR_Prediction

shariqiqbal2810/MAAC

hijkzzz/pymarl2

2019ChenGong/Machine-Learning-Notes

devsisters/pointer-network-tensorflow

Theohhhu/UPDeT

tjuHaoXiaotian/pymarl3

wouterkool/stochastic-beam-search

wendelinboehmer/dcg

TJU-DRL-LAB/Multiagent-RL

tjuHaoXiaotian/GASIL

tjuHaoXiaotian/ICML-2020-MSBCB

xtof-durr/makeSimple

tjuHaoXiaotian/SC1

tjuHaoXiaotian/Qfamily_for_MatrixGame

CNDOTA/NeurIPS22-ATM

google-research/unique-randomizer