tjuHaoXiaotian's Stars
youngyangyang04/leetcode-master
《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀
DLR-RM/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
thu-ml/tianshou
An elegant PyTorch deep reinforcement learning library.
PaddlePaddle/models
Officially maintained, supported by PaddlePaddle, including CV, NLP, Speech, Rec, TS, big models and so on.
LantaoYu/MARL-Papers
Paper list of multi-agent reinforcement learning (MARL)
MLNLP-World/Paper-Writing-Tips
MLNLP社区用来帮助大家避免论文投稿小错误的整理仓库。 Paper Writing Tips
higgsfield-ai/higgsfield
Fault-tolerant, highly scalable GPU orchestration, and a machine learning framework designed for training models with billions to trillions of parameters
opendilab/DI-engine
OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.
seungeunrho/minimalRL
Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)
google-deepmind/mctx
Monte Carlo tree search in JAX
openai/neural-mmo
Code for the paper "Neural MMO: A Massively Multiagent Game Environment for Training and Evaluating Intelligent Agents"
starry-sky6688/MARL-Algorithms
Implementations of IQL, QMIX, VDN, COMA, QTRAN, MAVEN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II
sudharsan13296/Hands-On-Reinforcement-Learning-With-Python
Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow
Johnson0722/CTR_Prediction
CTR prediction using FM FFM and DeepFM
shariqiqbal2810/MAAC
Code for "Actor-Attention-Critic for Multi-Agent Reinforcement Learning" ICML 2019
hijkzzz/pymarl2
Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)
2019ChenGong/Machine-Learning-Notes
白板推导系列课程笔记 初版
devsisters/pointer-network-tensorflow
TensorFlow implementation of "Pointer Networks"
Theohhhu/UPDeT
Official Implementation of 'UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers' ICLR 2021(spotlight)
tjuHaoXiaotian/pymarl3
We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enhanced algorithm achieves 100% win rates on SMAC-V1 and superior performance on SMAC-V2.
wouterkool/stochastic-beam-search
Implementation of Stochastic Beam Search using Fairseq
wendelinboehmer/dcg
TJU-DRL-LAB/Multiagent-RL
The official code releasement of publications in MARL field of TJU RL lab.
tjuHaoXiaotian/GASIL
Independent Generative Adversarial Self-Imitation Learning In Cooperative Multiagent Systems
tjuHaoXiaotian/ICML-2020-MSBCB
Code of ICML-2020 paper Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising
xtof-durr/makeSimple
algorithmes classiques implémentés dans le cadre du cours modal programmation efficace à l'Ecole Polytechnique, Palaiseau
tjuHaoXiaotian/SC1
tjuHaoXiaotian/Qfamily_for_MatrixGame
We provide a very simple implementation of the typical value decomposition methods for solving single state Matrix Games.
CNDOTA/NeurIPS22-ATM
google-research/unique-randomizer
UniqueRandomizer is a data structure for sampling outputs of a randomized program, such as a neural sequence model, incrementally and without replacement.